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Abstract 

We discuss several well known results about Schur functions that can be proved using cancellations in 
alternating summations; notably we shall discuss the Pieri and Murnaghan-Nakayama rules, the Jacobi- 
, Trudi identity and its dual (Von Nagelsbach-Kostka) identity, their proofs using the correspondence with 

lattice paths of Gessel and Viennot, and finally the Littlewood-Richardson rule. Our our goal is to show 
that the mentioned statements are closely related, and can be proved using variations of the same basic 
technique. We also want to emphasise the central part that is played by matrices over {0, 1} and over N; 
we show that the Littlewood-Richardson rule as generalised by Zelevinsky has elegant formulations using 
either type of matrix, and that in both cases it can be obtained by two successive reductions from a 
large signed enumeration of such matrices, where the sign depends only on the row and column sums 
of the matrix. 
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§0. Introduction. 

Many of the more interesting combinatorial results and correspondences in the basic theory of symmetric 
f~i ' functions involve Schur functions, or more or less equivalently the notions of semistandard tableaux 

"j^ I or horizontal strips. Yet the introduction of these notions, in any of the many possible ways, is not 

very natural when considering only symmetric functions, cf. [Stan, 7.10]. One way the importance of 
Schur functions can be motivated is by representation theory: interpreting symmetric functions in the 
representation theory either of the symmetric groups or the general linear groups, the Schur functions 
(N ■ correspond to the irreducible representations. However, there is another way of motivating it: if one 

broadens the scope slightly from symmetric functions to alternating polynomials, then Schur functions 
do arise quite naturally as quotients of alternants. It is this point of view, which could also be reached 
' from representation theory if use is made only of Weyl's character formula, that we shall take in this 

I paper; from this perspective it is not so surprising that proofs of basic identities involving Schur functions 

' should involve alternating summations and cancellations. 

The main point we would like to make in this paper is that the use of the definition of Schur functions 
as quotients of alternants can be limited to the deduction of a single simple formula (lemma 2.2), which 
i^H I describes the multiplication by an arbitrary symmetric function in the basis of Schur functions; after 

■ this, alternating polynomials need not be considered any more. In general the formula produces an 
^ I alternating sum of Schur functions; in various particular cases, one can obtain classical results from it (the 

Pieri and Murnaghan-Nakayama rules, Jacobi and Von Nagelsbach-Kostka identities, and the Littlewood- 
Richardson rule) by judiciously applying combinatorially defined cancellations. Our presentation is nearly 

■ self-contained, but we do use an enumerative identity that follows from the RSK-correspondence; we shall 
' omit its well known elementary combinatorial proof, which is not directly related to the theme of this 

paper. 

Our paper is structured as follows. In §1 we give the basic definitions concerning symmetric functions, 
alternating polynomials and (skew) Schur functions. In §2 we introduce our basic lemma, and its most 
elementary applications giving the Pieri and Murnaghan-Nakayama rules. In §3 we first establish the 
duality of the bases of complete and minimal symmetric functions (this is where the RSK-correspondence 
is used). This allows us to interpret (skew) Schur functions as generating series of semistandard tableaux 
(which elsewhere is often used as their definition) , and to deduce the Cauchy, Jacobi and Von Nagelsbach- 
Kostka identities. In §4 we discuss cancellations defined for intersecting families of lattice paths, in the 
style of Gessel and Viennot, and relate them to identities derived from the Pieri rules. These consid- 
erations lead to natural encodings of families of lattice paths, and of the semistandard tableaux that 
correspond to non-intersecting families of paths, by matrices with entries in N or in {0, 1}; these encod- 
ings are also important in the sequel. In §5 we give a final application of our basic lemma to derive the 
Littlewood-Richardson rule. Formulating (Zelevinsky's generalisation of) that rule in terms of binary or 
integral matrices reveals in both cases an unexpected symmetry. We also exhibit an equally symmetrical 
doubly alternating expressions for the same numbers, in which no tableaux appear at all. We close by 
raising a question inspired by these expressions, which will be taken up in a sequel to this paper. 
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1 Preliminaries and definitions 

§1. Preliminaries and definitions. 

Studying symmetric functions involves the use of various combinatorial objects; we start with some 
general considerations concerning those. We shall make much use of sequences (vectors) and matrices, of 
which the entries will almost always be natural numbers. In some cases the entries are restricted to be 
either or 1, in which case we shall reefer to the objects as "binary" . While all objects we shall encounter 
can be specified using finite information, we shall consider vectors and matrices as associations of entries 
to indices, without restricting those indices to a finite set (just like for polynomials one usually does 
not give an a priori bound for the degrees of their monomials). Thus vectors and matrices are "finitely 
supported" , in that the entries are zero outside a finite range of indices; finite vectors and matrices 
are identified with infinite ones obtained by extension with null entries. This convention notably allows 
addition of vectors or matrices without concern about their sizes. 

When displaying matrices we shall as usual let the first index increase downwards and the second to 
the right, and the same convention will be used whenever subsets of N x N are displayed, such as Young 
diagrams (in the sequel to this paper we shall in fact encounter Young diagrams in the role of subsets of 
indices in matrices). Some objects, notably tableaux, are defined as sequences of vectors; in this case the 
indices for the sequence are written as parenthesised superscripts to avoid confusion with the subscripts 
indexing individual vectors. 

We always start indexing at 0, in particular this applies to sequences, rows and columns of matrices 
and tableaux, and entries of tableaux. Hence in the situation where a sequence of objects is determined by 
the intervals between members of another sequence (such as horizontal strips in a semistandard tableau, 
which are given by successive members of a sequence of shapes), the index used for an interval is the same 
as that of the first of the members bounding it. Our standard n-element set is [n] = { z S N | i < n}. 
For the set theoretic difference S\T we shall write S — T when it is known that T C S. 

We shall frequently use the "Iverson symbol" : for any Boolean expression condition one puts 



This notation, proposed in [GKP, p. 24], and taken from the programming language APL by K. Iverson, 
generalises the Kronecker delta symbol: instead of Sij one can write [i = j] ■ Among other uses, this 
notation allows us to avoid putting complicated conditions below summations to restrict their range: it 
suffices to multiply their summands by one or more instances of [ condition ] . By convention, in a product 
containing such a factor, the factors to its right are evaluated only if the condition holds; if it fails, the 
product is considered to be even if some remaining factor should be undefined. 

1.1. Compositions and partitions. 

The most basic combinatorial objects we shall use are finitely supported sequences of natural numbers 
a = (ai)igN- The entries are called the parts of a, and the main statistic on such sequences is the sum of 
the parts, written |a| = J2ieN "^^^ systematic name for such sequences a with |a| = d would be infinite 
weak compositions of d, but we shall simply call them just compositions of d. The set of compositions of d 
will be denoted by Cd (this set is infinite when d> 0), and C = UdeN denotes the set of all compositions. 
In order to denote specific compositions, we shall specify an initial sequence of their parts, which are 
implicitly extended by zeroes. When the parts of a composition are restricted to lie in {0, 1} = [2], it will 
be called a binary composition; we define Cl'f = { a G Cd | Vi G N: e [2] } and C'^' = UdGN^d'- Binary 
compositions of d correspond to d-element subsets of N, while arbitrary compositions of d correspond to 
multisets of size rf on N. Among other uses, compositions parametrise monomials; if Xn = {Xi | i G N } 
is a countable set of commuting indeterminates, then the monomial HiGN -^i"^ '^'^^^ denoted by X". 

We shall consider permutations of indeterminates, and correspondingly of the parts of compositions. 
The group that acts is the group Sqc of permutations of N that fix all but finitely many numbers. The 
permutation a € Soo acts by simultaneously substituting Xi := ^^^(i) for all indeterminates, and therefore 
operates on compositions by permuting their parts: a{a) = (ao.-i(i))jeN- Obviously \cr{a)\ = \a\, and 
the orbit of a contains a unique composition whose parts are weakly decreasing, which will be denoted 
by a"'"; for instance for a = (0,5,2,0,0,1,7,0,2) one has = (7,5,2,2.1). For cZ G N we define the 
finite set Pd = { A G Q | Vi G N: Aj > A^+i }, whose elements are called partitions of d; then a+ G Vd 
for any a G Cd- We also put V = UdeN^'i- binary compositions of d form a single orbit under 
permutations of their parts, so there is just a single binary partition of d: it is the partition [i G [d] ] 
whose d initial parts are 1 and the rest 0, and we shall denote it by l^''^ 
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1.1 Compositions and partitions 



Wc shall usually denote compositions by Greek letters a, /3, . . ., but for partitions we use Greek letters 

further on in the alphabet: A, yit, v, and sometimes k. Apart from listing its nonzero parts, a partition 

X €V can also be specified by drawing its diagram [A] = { (i, j) G | j G [Aj] }. Elements of the diagram 

are drawn (and usually referred to) as squares, so that for instance the diagram of A = (7, 5, 2, 2, 1) would 

be drawn as 

I I I I I I I I 



The transpose partition of A G "P, which will be denoted by A', is the one whose parts give the lengths of 
the columns of [A], so that [A*] is the transpose diagram [A]*; one has A*- = ^{ j g N | j e [A^] }. 

We shall be considering several relations defined between partitions; we collect their definitions here. 
The most fundamental relation is the partial ordering 'C' defined by inclusion of diagrams: /z C A means 
that [fj] C [A] or equivalently that fii < \i for all i G N. Note that if ;U C A then A — /x and A* — ^* are 
compositions. The relation 'C' will be used mostly implicitly via the notion of a skew shape A//z, which 
denotes the interval from /x to A in the poset {V, C); the corresponding skew diagram is [A//x] = [A] — [fi], 
and we define |A//x| = |A| — Several relations refining 'C' will be used; for the ones in the following 
definition it will be convenient to define them on the set of all compositions, although they will never 
hold unless both arguments are actually partitions. 

1.1.1. Definition. Tiie relations and on C are defined as follows. To have either (j, ^ \ or 

H \— \, it is necessary that A//i be a sJcew shape (in other words X, fi E V, and /i C Aj. If this is the case, 
then /X ^ A holds if and only if X — n gC^^\ in which case A//z is called a vertical strip; similarly ^ A 
holds if and only if Aj+i < Hi < Xi for all i G N, in which case X/fi is called a horizontal strip. 

Note that the final condition for ^ A already implies that A//x is a skew shape; in addition it 
means that [X/ fi] has at most one square in any column. Similarly, for a skew shape A//i, the condition 
/U ^ A means that [X/h] has at most one square in any row. Therefore /x ^ A is equivalent to /x* ^ A* 
when XjH^V. To denote the opposite relations we shall rotate rather than reflect the symbol, so A — ^ /x 
means the same as /i ^ A, while A ^ /i means the same as fi ^ X. 

We illustrate concrete instances of these relations graphically by superimposing the contours of the 
diagrams of the two partitions involved: 



(7, 5, 2, 2, 1)^(8,6,3, 3, 1,1,1): 



(7,5,2,2,1)^(11,6,4,2,1,1): 




For the following definition we use the partitioning of N x N into diagonals Dd, for d G Z: 

Dd = {{i,j)G'N^\j-i = d} fordGZ. (1) 

1.1.2. Definition. For k > 0, a relation '-<r(fe) ' on V is defined as follows: /x -<r(fc) A means that A//x 

is a skew shape with |A//i| = k, for which the k sciuarcs of [X/ fi] lie on k consecutive diagonals. In this 
case we call the shape A//U a k-ribbon. The height ht(A//x) of a k-ribbon X/h is the difference between 
the initial (row) coordinates of the squares of [A//x] on the first and the last of those k diagonals. 

Again we give a graphic illustration in the same style as before, for X/iJb= (7, 6, 6, 3, 3, 2)/(7, 5, 2, 2, 1): 



(7,5,2,2,1) ^,(10) (7,6,6,3,3,2): 




One has [A//x] = {(5, 0), (5, 1), (4, 1), (4, 2), (3, 2), (2, 2), (2, 3), (2, 4), (2, 5), (1, 5)}, which diagram has its 
squares on the 10 consecutive diagonals Dj, for — 5 < < 5; moreover, we see that ht(A//x) = 5 — 1=4. 
Finally we shall need the dominance partial ordering on each set Vd separately. 

1.1.3. Definition. For any fixed d G N a relation '< ' on Vd, called the dominance ordering, is defined 
by jji <X if and only if for every fc G N one has J2ie[k] l^i — Sie[/c] ^i- 
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1.2 Matrices and tableaux 
1.2. Matrices and tableaux. 

Wc shall use the two-dimensional counterparts of compositions: finitely supported matrices with entries 
in N. Like for compositions the binary case, where entries are restricted to [2] = {0, 1}, will be of special 
interest. The statistic given by the sum of all entries can be refined by taking sums separately either of 
rows or of columns; in either case the result is a composition. 

1.2.1. Definition. Let Ai denote the set of matrices (Afi.j)i,jgN with entries Mi,j in N, of which only 
finitely many are nonzero, and let A4'^' denote its subset of binary matrices, those of which all entries lie 
in {0, 1}. Let row: — > C be the map M {J^je-yi -^j,j)ieN that takes row sums, and col: A4 ^ C the 
map M (X^igN ^^i,i)jGN that takes column sums; put Ma,0 = {M £ A4 \ row(M) = a, col(M) = (3} 

and M^'^^p = Majj n 'x^"' for a, /3 G C. 

These matrices can be used to record sequences of (binary) compositions with finite support, either 
by rows or by columns. We shall denote row i of M by Mj, and column j by M|. We shall also need 
sequences of partitions, but these will be subject to the condition that adjacent terms differ by horizontal 
or vertical strips, and the condition of finite support is replaced by the sequence becoming ultimately 
stationary. This gives rise to the notion of semistandard tableau, and some variants of it. 

1.2.2. Definition. Let X/n be a skew shape, and a € C. A semistandard tableau of shape X/fi and 
weight a is a sequence of partitions (A(*^)igN with X^^^ -r- X^^'^^^ and jA^'+^V'^^'^l = ^or all z S N, 
A(o) = fx, and X'-^^ = X for any N that is so large that a, = for all i > N. The weight of a tableau T 
is denoted by wt{T), and the set of all semistandard tableaux of shape X/fi by SST(A/^); we also 
put SST(A//x,Q!) = {T e SST(A/y^) | wt(T) = a}. A transpose semistandard tableau of shape X/ fjb and 
weight a is a sequence of partitions defined similarly, with A^*^ ^ A^*"'""'^) replacing A^*) ^ A*^*+^) . 

Wc shall reserve the qualification "Young tableau" to the case fi = (0), in which case A//i will 
be abbreviated to A in the notations just introduced. There are maps from semistandard tableaux to 
transpose semistandard tableau and vice versa, defined by transposing each partition in the sequence; 
under these maps the shape of the tableau is transposed while the weight is preserved. Another variation 
on the notion of semistandard tableau is to replace the relations A^*) A(*+^^ or A^^^ ^ A^*"*"^) by their 
opposite relations A*^*^ ^ A(*+^^ respectively A^*^ A*^*+^). This gives the notions of reverse (transpose) 
semistandard tableaux, which will occur in the sequel to this paper; their shape X/n and weight a are 
such that the sequence starts at A = A^"^ and ultimately becomes /x, while lA'^'V'^'^'"'^^^! — fo'" ^ i € N. 

The traditional way to display a semistandard tableau is to draw the diagram of its shape filled with 
numbers, which identify for each square the horizontal strip to which it belongs. We shall label with an 
entry i the squares of [A*^'+^^ /A*-*^]. The entries will then increase weakly along rows, and increase strictly 
down columns, and for this reason semistandard tableaux are also called column-strict tableaux (and 
transpose semistandard tableaux are then called row-strict tableaux). Thus the semistandard tableau 
T = ((4, 1) ^ (5, 2) ^ (5, 3, 2) ^ (6, 3, 3, f ) ^ (6, 4, 3, 2) ^ (7, 5, 4, 3) ^ (9, 5, 5, 3, 1) ^ (9, 8, 5, 5, 3) ^ 
(9, 8, 5, 5, 3) . ) , which is of shape (9, 8, 5, 5, 3)/(4, 1) and weight (2, 3, 3, 2, 4, 4, 7), will be displayed as 
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(2) 



More important in our paper than this display will be two ways of representing tableaux by matrices. 
Simply recording the partitions forming a tableau T in the rows or columns of a matrix does not give a 
finitely supported matrix, but we can obtain one by recording the differences between successive partitions. 
We shall call the matrix so obtained an encoding of T, but one should realise that decoding the matrix 
to reconstruct T requires knowledge of at least one of the partitions forming the (skew) shape of T. 
Various ways are possible to record horizontal strips A^'+^'^^/A'^*^: one may either record the differences 
A(»+i) _ xM g C or the differences between the transpose shapes (A(*+^^)* — (A^*))* G C'^', and one 
may record these compositions either in the rows or the columns of the matrix. Prom the four possible 
combinations we choose the two for which one has a correspondence either between the rows of the tableau 
and the rows of the matrix, or between the columns of the tableau and the columns of the matrix. 
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1.3 Symmetric functions 



1.2.3. Definition. Let T = (A'^'^)jgN be a scmistandard tableau. The integral encoding of T is the 
matrix M e M dehned by Mi^ — (A*^-'+^' — A^-^^)^, and the binary encoding ofT is the matrix M' e A^'^' 
defined by - = ((A(*+^))* — (A'*^)*)j. The sets of integral and binary encodings of semistandard 
tableaux T e SST(A//i) will be denoted by Tabl(A/z/) and Tabl'''(A//i), respectively. 

For instance for the tableau T of (2), one finds the integral and binary encodings 



M = 



/I 1 1 2 0\ 
110 110 3 
2 10 110 
1110 2 

\0 1 2/ 



and 



M' = 



^0 1 1 On^ 

1 1 1 
101001000 

1 1 

1110 10 

1 1 1 1 
\o 1 1 1 1 1 1 1 0/ 



(3) 



which finite matrices must be thought of as extended indefinitely by zeroes. To reconstruct from cither 
of these matrices the tableau T or the other matrix, one must in addition know at least that jj, = (4, 1) 
or that A = (9, 8, 5, 5, 3) for the shape A//i of T. Each entry Mij counts the number of entries j in row i 
of the displayed form of T, while entry ^ coimts the number (at most one) of entries i in column j. 
Therefore the row Mj records the weight of row i of the display of T, while the column (M')*- records the 
weight of its column j. One has row(M) = A - /x, col(M') = A' - /x*, and col(M) = row(M') = wt(T). 

1.3. Symmetric functions. 

There are several equivalent ways to define the ring A of symmetric functions. Following [Stan] , we shall 
realise A as a subring of the ring Z[[Xn]] of power series in infinitely many indeterminatcs. The elements / 
of this subring are characterised by the fact that the coefficients in / of monomials AT", X^^ arc the same 
whenever = /J"*" (so / is stable under the action of Soo), and that the degree of monomials with 
nonzero coefficients in / is bounded. Elements / G A are called symmetric functions. Since the indicated 
subring of Z[[ArN]] is just one realisation of A, we make a notational distinction between occurrences of 
a symmetric function / that are independent of any realisation of A (for instance in identities internal 
to A), and occurrences where the realisation inside Z[[ArN]] is essential (because indeterminatcs Xi occur 
explicitly in the same equation); in the latter case we shall write /[A^n] instead of /. If the nonzero 
coefficients of /[^n] only occur for monomials of degree d, then / is called homogeneous of degree d; due 
to the required degree bound, this makes A into a graded ring. 

Another realisation of A is via its images in polynomial rings in finite sets of indeterminates. This 
is for instance the point of view taken in [Macd]; for us this realisation is important in order to be able 
to consider alternating expressions, which is hard to do for infinitely many indeterminates. For any 
n G N, let ATjn] = { AT^ | i G [n] } be the set of the first n indeterminates. There is a ring morphism 
Z[[Xn]] Z[[X[n]]] defined by setting Xi := for all i > n, and the image of the subring A under this 
morphism is the subring of the symmetric polynomials in Z[Ar[„]], those invariant under all permutations 
of the indeterminates; we shall denote this image by A[„]. For / G A, the image in A[„] of /[ATn] G Z[[ArN]] 
will be denoted by /[Ar[„]]. Thus each / G A gives rise to a family (/[X[„]])„gN of elements G A[„] 

of bounded degree, which family is coherent with respect to the projections A[„_|_i] A[„] defined by the 
substitution X„ := 0. We shall write this final property as /[-'^[n+i]] [Ar„ := 0] = /[Ar[„]] for all n G N. 
Conversely each family (/„)„gN with /„ G A[„] for all n G N that satisfies /„+i[Ar„ := 0] = /„ for all n, 
and for which deg /„ is bounded, forms the set of images of a unique element / G A. In other words, one 
can realise A as the inverse limit in the category of graded rings of the system (A[„])„gN relative to the 
given projections A[„+i] A[„]. 

For any a G Vd, the sum [ATn] = X]/3eCd [ '^^ ~ '^^ ] distinct monomials in the permuta- 

tion orbit of AT" is a symmetric function. Since no nonempty proper subset of its nonzero terms defines 
a symmetric function, we shall call a minimal symmetric function (we avoid the more traditional 
term "monomial" symmetric function since the set of all is not closed under multiplication). The set 
{ m\ I A G T'd } is a basis of the additive group of homogeneous symmetric functions of degree d. 

The elementary symmetric functions 64 for d G N are instances of minimal symmetric functions: 
they are defined as = m^d) . One can write more explicitly 



N 



[2] 



...,id6N 



<id]Xi^--- AT,, 



(4) 
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1-4 Alternating polynomials and Schur functions 



The complete (homogeneous) symmetric functions hd for d G N arc defined by hj, = "^x^-p^fnx- 
the elementary symmetric functions, they can be written more explicitly 



Like 



(5) 



n,...,ideN 



The power sum symmetric functions Pd for d > are defined by Pd = m(^d) > so Pd[-'^N] = X^ieN -^i- These 
families of symmetric functions have the following generating series, expressed in Z[[Xn,T]]. 



J2ed[X^]T''=l[il+XiT), 

deN iGN VfcGN / 

^Pk[x^]T' = J2 [T.(^^Tn 

k>0 iGN \fe>0 / 



iGN 



X.T' 



iGN * 



(6) 
(7) 
(8) 



For any a € C we define — JlieN "^"i ^"^^ ~ IlieN ^"i ! since cq = /lo = 1 the infinite products 
converge, and it is clear by commutativity that Cq, = eQ,+ and ha = The products and ha 

can be expanded into monomials combinatorially, in terms of binary respectively integral matrices: by 
multiplying together copies of the first equality in (4) respectively in (5), one finds 



(9) 
(10) 



We can obtain generating scries in Z[[Xn, Yn]] in which all or all hfj appear, cither from the preceding 
equations, or by substituting T := Yj into copies of (6) or (7) for j £ N and multiplying them, giving 



l3eC MGMl^l ijGN 



)1(M) 



i,jGN 



XiYj 



(11) 
(12) 



1.4. Alternating polynomials and Schur functions. 

Now fix n e N, and let A[n] denote the additive subgroup of Z[X[„]] of alternating polynomials, i.e., of 

polynomials p such that for all permutations o" 6 S„ the permutation of indetcrminatcs given by cr operates 
onp as multiplication be the sign e{a). Multiplying an alternating polynomial by a symmetric polynomial 
gives another alternating polynomial, so if we view Z[X[„]] as a module over its subring A[„], then it 
contains ^[„] as a submodulc. Like for symmetric polynomials, the condition of being an alternating 
polynomial can be expressed by comparing coefficients of monomials in the same permutation orbit: a 
polynomial X^^g^n c^X" is alternating if and only if for every a € N" and ct G S„ one has c^ a = s{a)ca- 
In particular this implies that Cq, = whenever a is fixed by any odd permutation, which happens as 
soon as aj = aj for some pair i ^ j- In the contrary case, a is not fixed by any non- identity permutation, 
and the alternating orbit sum Oa[X[„]] = Xo-gs e(f )-^'^'" is an alternating polynomial that is minimal 
in the sense that its nonzero coefficients arc all ±1 and no nonempty proper subset of its nonzero terms 
defines an alternating polynomial. The element aQ[X[„]] is called an alternant, and can be written as a 
determinant 



a„[X[„]]=det(x;0. 



x° 



x^ 



(13) 



-1 n—l n—1 

we define aa[X[„]] by the same expression even when a is fixed by some transposition, but in that case it 
is 0. The set of alternants generates A^n] as an additive group, but to obtain a Z-basis one must remove 
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1-4 Alternating polynomials and Schur functions 



the null alternants, and for all other orbits of compositions choose one of the two opposite alternants 
associated to it. Thus one finds the Z-basis { aa[X[„]] | a £ N"; ao > ■ ■ ■ > a„_i } of Our convention 
of interpreting finite vectors by extension with zeroes as finitely supported ones, allows us to view N" as 
a subset of C. Then putting (5„ = (n — 1, n — 2, . . . , 1, 0) € N" the above basis of can be written as 

{a5„+A[^[n]] I AeT'nN"}. 

Put An = as^[X^nj]; in other words, A„ € A[n] is the Vandermonde determinant, which evaluates 
to Ylo<i<j<ni-^i ~ ^j)- Alternating polynomials are all divisible by each factor Xj — Xj, and therefore 
by A„,. So viewing A[n] as an A[„]-module, it is cyclic with generator A„. The map A[„] A[n] of 
multiplication by A„ is a Z-linear bijection, so one can apply its inverse to the basis of consisting of 
elements a5„_|_A[Ar[„]]. Thus defining 

Sa[A-[„]J - -r e A[„] (14j 

for A e P with A„ = 0, the set { SA[A^[n]] | A S "P; A„ = } forms a Z-basis of A[„]. It is useful to define 
Sa[X[„]] for arbitrary a G N" by the same formula. Doing so does not introduce any new symmetric 
functions, since one has Sq[X[„]] = unless the n components of (S„ + a are all distinct, and in that case 
one has Sq,[X[„]] = ssaIATj^]], where X € V with A„ = is determined by the condition ((5„ + a)^ = i5„ + A, 
and £ is the sign of the (unique) permutation ct e S„ such that <7{6n + a) = (5„ + A. Finally we extend 
this definition to any (infinite) composition a G C, by defining s„[X[„]] = whenever a ^ N". 
Here are some examples illustrating these definitions. One has 



5(3,1) [A^[2]] 



^4,1)[ 



)[^[2]] _ Xq^I - XpXf _ I V2 y2 , y y3 

X — Y V ~ -^0^1 + -^0^1 + -^0-^1 

A2 Aq — Ai 

'7^(3,1) [-^^[2]] + TO(2,2)[X[2]] 



, _ a(2,2)[A:[2]] 

S(l,2)lA[2]J - — = 

, 1 «(1.4)[-^[2]] XpXf X^X, 

S(0,4)[A[2]J = ^- = _ = -S(3,i)[A[2]J 



S(3,l)[-'^[3]] 



a(5,2,o)[^[3]] ^ XlXj - XgXf - X^Xj + XfXj + XjXl ~ X^Xj 
As {Xo - Xi){Xo - Xi){Xi - X2) 

= XqXi + XqX2 + XqX^ + 2XqX\X2 + XqX| 

+ J'^o^i + 2XqX'^X2 + 2X0X1X2 + A'oA'2 + ^1^2 + -^1-^2 "I" -^1-^2 
= '^(3,l)[-'^[3]] + m(2,2)[X[3]] + 2TO(2,1,i)[X[3]] 

1.4.1. Proposition. For aJJ n G N and all a G C one has Sa[X[„_|_i]][X„ := 0] = Saf-'^in]]. 

For instance one sees in the example above that S(3,i) [Xpj] [X2 := 0] = S(3,i) [-^[2]]^ from the definition 
one has ma[X[„_|_i]][X„ := 0] = ma[X[„]] for all a G C, while ma[X[„]] = unless a G N" C C, so in 
particular J72(2.i,i) [^^[2]] = 0. 

Proof. The value Sa[X[„_(.i]][X„ := 0] can be computed by applying the substitution X„ := separately 
to the numerator a5„_|_^+Q,[X[„_|_i]] and the denominator A„+i in the definition of Sq[X[„+i]], provided that 
the latter substitution yields a nonzero value; this is the case since A„+i[X„ 0] = Xo ■ ■ ■ X„_iA„. We 
may assume that a G N""*"^ holds, since otherwise both Sq,[X[„+i]] and Sq,[X[„]] are zero by definition. Now 
put l3 = Sn+i + a = (n + ao, . . • , 1 + a„_i, an) G N"+^, so that the mentioned numerator is a/3[X[„_|_i]]. 
If one has a ^ N", so that .Sa[X[„]] = by definition, then a„ ^ 0, and all of the (first n + 1) 
components of f3 are nonzero; in this case the substitution X„ :— kills all terms of the numerator, 
so that s„[-'^[n+i]][-'^n 0] = 0. On the other hand if a G N" then also P G N", and in this 
case a/3[A'[„^]^]][X„ := 0] = a^[X[„]] = Xo ■ ■ ■ Xn-iCLs^^a[X[n]\] after simplification of the substituted 
numerator and denominator by Xq ■ ■ ■ Xn-i, one obtains SQ[Ar[„_|_i]][X„ := 0] = Sc[X[„]] as desired. □ 

Thus for fixed a gC, the families {sa_[X^n]])ne'N of symmetric polynomials are coherent with respect 
to the projections A[„_|_i] A[„] defined by the substitution X„ := 0. This property allows the following 
definition, which we already anticipated in our notation. 
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1.5 Diagram boundaries 



1.4.2. Definition. For a &C, the symmetric function .Sq is the unique element of A whose image in A[„] 
under the substitutions Xi := for all i > n is s„[X[„]] = [ q e N" ] ^^^fp^, for all n e N. 

The sot {sx \ X E V} forms a Z-basis of A, whose elements arc cahcd Schur functions. They arc the 
central subject of this paper, and we shall now introduce several notations to facilitate their study. Firstly 
we shall denote by ( ■ | • ) the scalar product on A for which the basis of Schur functions is orthonormal. 
Thus one has 

/ = X! ( / I ■SA )sA for any / G A. (15) 
xev 

The operation of multiplication by a fixed Schur function has an adjoint operation s* for this scalar 
product, i.e., which satisfies (s*(/) \ g) = ( / | s^g) for all f,g G A; in terms of this the skew Schur 
functions are defined by 

Sa/,^ = s*^{sx)- (16) 

They typically arise when one expresses the multiplication by a fixed symmetric function in the basis of 
Schur functions, as skew Schur functions are characterised by 

{sj\sx) = (f\sx/^) forallA,MGPand/e A. (17) 

For f = ha and f = Ca these scalar products are of particular interest, and are called Kostka numbers. 

1.4.3. Definition. For iJ,,X gV and a €C, we set Kx/p,^a = {ha\ s^/^ ) and K'^^^^ ^ — (^Ca \ s^/^ )• 

When jjL = (0), we abbreviate K^/^^a to -^A,a and K'^j^ ^ to K'^ ^. Then as a special case of (15) 
one has 

hoi=^ K\^aS\ and e„ = ^ K'^^a^x- (18) 
xev Xev 

We shall later give combinatorial descriptions of the Kostka numbers (corollary 2.6), from which it will 
become clear that they are non-negative (this also follows from representation theoretic considerations), 

and are related by K'^^^ ^ = -K^At/^t 

Computing scalar products ( / | s\) directly from the definition is usually quite hard. But if / = Sq, 
with a G C, then / can be considered as slightly generalised Schur function: either / = 0, or / = ±s\ for 
some X gP. Setting 

e{a, X) — (^Sa \ s\) for a G C and X eV, (19) 

one has e(a. A) G {--1,0, 1}, and given a there is at most one A with e(a. A) ^ 0; this symbol will be 
used as a signed variant of the Iverson symbol. Any expression in terms of such Sa can be converted to 
one in terms of Schur functions using 

Sa = ^£{o:,>^)sx, (20) 
xer 

which removes null terms, and replaces the remaining terms Sq by the appropriate ±s\ with A G "P. This 
process, which is the main source of alternating sums in this paper, will be called normalisation. 

1.5. Diagram boundaries. 

The partition A with e{a, A) 0, if any, is characterised by the condition (5„ + A = ((5„ + a)^, where n 
is so large that a G N", and e{a, A) is the sign of the permutation a G S„ such that Sn + X = a{Sn + a). 
The given condition gives rise to an equivalent one when n is increased, and the permutation involved 
does not change either, if each S„ is considered as a subgroup of Soo- Nonetheless, it is convenient to 
have a description that does not involve n at all. If we subtract n from each of the n entries of Sn + a: = 
{n—l — i + Oi) jg [„] , then coefficient i becomes ai — l — i, and therefore independent of n > i; moreover this 
transformation is compatible with the action of S„ by permutation of the coefficients. Now increasing n 
allows to associate with a a unique infinite sequence of numbers; note however that it will contain negative 
entries, and will no longer be finitely supported. We shall denote this sequence by a[] = {a[i])i^^, where 

a[i] = Ui — 1 — i for a G C and z G N. (21) 

Thus if e{a, A) is nonzero, it is the sign of the unique permutation a G Sqo, that transforms a[\ into A[]. 
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1.5 Diagram boundaries 



Instead of using sequences a.[\. one eould understand e(a, A) using an alternative way for the 
group Soo to act on finitely supported sequences of integers: defining a ■ a = (cx.^-i(j,-^ + i — c~^(*))j£isf) 
one may interpret e{a,X) as the sign of the unique permutation a G Soo, if any, such that a ■ a = X. 
However, considering sequences a[] has the advantage that they can be interpreted directly in a graphical 
manner, in particular when a is a partition. Each part Aj of a partition A corresponds to a row of its 
diagram [A] , and therefore to a unique vertical segment of the bottom right boundary of [A] , namely the 
segment that delimits row i of [A] to the right. In case A^ = 0, row i of [A] is empty, but we can still 
consider it as delimited to the right by the segment on the vertical axis that crosses row i. Now, whereas 
the sequence A records the horizontal coordinates of these vertical segments (taken from top to bottom), 
the sequence A[] records their "diagonal coordinates", where a segment has diagonal coordinate d if it 
connects points on diagonals d and d + 1. For instance, for A = (7, 5, 2, 2, 1) the shape of the boundary 
can be drawn as 



and the sequence of the diagonal coordinates of its vertical segments, taken from top to bottom, is 

(6, 3, — 1, — 2, — 4, — 6, — 7, . . .) = A[]. For general compositions a one could define a "diagram" in the 
same way as for partitions (although we did not), and while their boundaries are more ragged, reading 
off the diagonal coordinates of the vertical segments from top to bottom still gives the sequence a[]. 

Let us make more precise some terms that were used above. Both squares and points are formally 
elements of N x N, but we shall treat them as different types of objects. The square {i,j) has four 
corners, which are the points in the set + 1} x + 1}, so the square has the same coordinates as 
(and is formally speaking identified with) its top left corner; this is in keeping with our convention to 
index by i the interval from i to i + 1. Here we shall treat the diagonals Dj, (which were defined in (1)) as 
sets of points, which should not be too confusing since a square "lies on the same diagonal" as its top left 
corner. The boimdary associated to A € "P consists of a set of points, one on each diagonal Dd, which are 
connected by edges between the point on Dj. and the point on Dd+i for each d G Z. Using the fact that 
each diagonal is totally ordered under the coordinate-wise partial ordering of points, the point on Da can 
be characterised as the minimal element of the difference set i?d\[A], in other words the top-leftmost point 
on Dd that is not the top-left corner of any square of [A] . The edges of this boundary are also indexed by 
numbers G Z (their diagonal coordinate), as described above. Thus the vertical edge associated to the 
part Xi of A is the one between the points (i, Aj) and (i + 1, A,;), which has as diagonal coordinate the index 
Xi — {i + 1) = X[i] oi the diagonal containing the latter point. In the example above the vertical segments 
are(0,7)-(l,7), (l,5)-(2,5), (2,2)-(3,2), (3,2)-(4,2), (4,1)-(5,1), (5,0)-(6,0), (6,0)-(7,0), . . . 

We did not use the horizontal segments of the boundary associated to A G "P. However, since the 
sequence A[] is strictly decreasing, it is entirely determined by the set {X[i] | i G N} C Z, and the set 
of diagonal coordinates of the horizontal segments of the boundary forms the complement of that set 
in Z, which can be written as { — 1 — X*[j] | j G N }. It is often useful to think of A as represented by the 
characteristic function of the set { A[z] | i G N }, i.e., the function Z {0, 1} that takes the value 1 at d 
if d is the diagonal coordinate of a vertical segment of the boundary, and the value if d is the diagonal 
coordinate of a horizontal segment. In turn that function can be thought of as a doubly infinite sequence 
of bits (the edge sequence of A) , which describes the form of the boundary, traversed from bottom left to 
top right. The relations ^ X, fi ^ X and n -<r(fe) A can be easily understood in terms of transpositions 
of bits in edge sequences; notably /i -<r(fe) A means that the edge sequence of A is obtained from that of /i 
by transposing a single bit 1 with a bit that is k places to its right. This is graphically obvious if we 
draw the diagram of the k by superimposing the boundaries associated to the partitions involved: 



(7,5,2,2,1)^,(10) (7,6,6,3,3,2): 



the transposition involves the underlined bits in • • • 1111010110001001000 • • •. Moreover ht(A//x) is the 
sum of the bits that are being "jumped over" during the transposition, which is 4 in the example. 



9 



2 The Fieri and Murnaghan-Nakayama rules 

1.5.1. Proposition. For k > Q and /i, A G V, the relation ^ ^r{k) ^ holds if and only if there exist 
indices io, «i G Z such that { iJ,[i] | i € N — {io} } = { X[i] \ i gN — {ii} } and ij[io\ + k = X[ii]. In this 
case one has moreover ht(A//Lt) = io — ii. 

Proof By definition, /x -<r(fe) A means that k consecutive points on the boundary associated to /z must 

be moved one place down their diagonal to reach the boundary associated to A. This means that only 
the two segments that link a point of this subset of k points and a point of its complement change 
orientation, and their diagonal coordinates differ by fc. In terms of the set { /Lt[i] | z e N }, this means that 
one element /i[io] is replaced by /i[io] + k = X[ii], and since io and ii are the row numbers of the vertical 
boundary segments with diagonal coordinates n[io] and A[ii], respectively, one has ht(A//z) = io — ii- □ 

§2. The Pieri and Murnaghan-Nakayama rules. 

In this section we shall compute the product of Schur polynomials by respectively elementary, power-sum 
and complete symmetric functions, expressing the result in the basis of Schur functions. These lead to 
nice combinatorial formulae known as the Pieri and Murnaghan-Nakayama rules, in which one encounters 
the notions of vertical and horizontal strips, and of ribbons. 

Before starting our computations, we consider the validity of summations involving the values for 
all a G C„ at once. Although each of these is either zero or up to a sign equal to one of the finitely many 
Schur functions s\ with A G Fm this does not prove that only finitely many Sq, are nonzero, and such 
summations therefore well defined. The following proposition shows that this is nevertheless the case. 

2.1. Proposition. If Sa for some a G C„, then a G N", where N" is considered as subset of C. 

Proof If Sq- 7^ 0, then there exists A G T'n C N" with e{a,X) ^ 0. Now the sequence a[] is a 
permutation of A[], and therefore in particular it c;ontaiiis all values —1 — i for i > n as entries. 
If a had any part Qj > with i > n, then for the largest such i, the value —1 — i would not 
occur in the sequence a[], which would give a contradiction. Therefore one must have a G N". □ 
The starting point of our approach is a very simple formula describing the multiplication between a 
Schur function and an arbitrary symmetric function. 

2.2. Lemma. Let / G A, and write /[Xn] as a sum of monomials f[X^] = X^^^gc ^aX", then 

aeC 

Proof. First, we must assure that the summation is effectively finite (or stated more fancily: that 
it converges in A for the discrete topology), so that the right hand side is well defined. For this it 
suffices to observe that in order to have ^ one must have \a\ < degf, and that in order to have 
Sa+i3 7^ 0, one must have a + f3 £ ]srl"l+l/3| by proposition 2.1, and therefore certainly a G N'^'+I'^I, 
which leaves only finitely many possibilities for a. With this point settled, it will suffice to prove that 
the image /[X[„]]s^[Af[„]] = X^„gisfn CaSa+/3[X[„]] of the identity in A[„] holds for all n G N. We may 
assume G N"; it is then obvious that /[A[„]] A''"+'^ = X^aeN" c^X^"^"^^ . To this equation we apply 
every cr G S„, acting by permutation of the indeterminates, and take the alternating sum by £(17) of 
the resulting equations. Since /[-'ffn]] is invariant under such permutations, we obtain the equation 
f[X[n]]as„+i3[X[n]] = SasN" Caas„+a+p[X[n]] in A[„]. Now division by A„ gives the desired identity. □ 

This lemma can be paraphrased as follows: when multiplying symmetric functions by Schur functions, 
one may behave as if X^Sjs were equal to Sa+/3 (although of course it is not). This lemma will be our 
main tool for this paper. It allows formulae for products of symmetric functions by Schur functions to 
be obtained very easily, but the resulting expression on the right hand side is not normalised, even 
when /3 G "P. After normalisation one gets an alternating summation with possibly many internal 
cancellations; the remaining task is to perform such cancellations explicitly, and describe the surviving 
terms in a combinatorial way. This often involves a certain amount of arbitrariness: in general an 
alternating sum neither uniquely determines the set of terms that will survive the cancellation, nor the 
way in which the other terms are paired up so as to cancel (in so far as they are not already null 
initially). In our applications however, a set of candidates for survival that are already normalised will 
usually present itself, and the arbitrariness is limited to the way in which the remaining terms cancel. 

Here is a concrete example of a computation using lemma 2.2, namely that of the decomposition of the 
product TO(3^i)S(2,i) into Schur functions. We traverse the Soo-orbit of (3, 1) in decreasing lexicographic 
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2 The Fieri and Murnaghan-Nakayama rules 

order ((3, 1), (3, 0, 1), (3, 0, 0, 1), ... , (1, 3), (1, 0, 3), ... , (0, 3,1),..., (0, 1,3),..., (0, 0, 3, 1), . . . ), but 
omit those elements a for which ((2,1) +Q;)[] has repeated entries. After normahsation there are no 
cancellations, but in two cases a pair of terms is combined into one: 

"^(3,1)5(2,1) = ■S(2,l)+c« 

c«eSoo-(3,i) 

= S(5,2) + ^(5,1,1) + 5(3,1,3) + 5(3,1,0,0,3) + ^(2,4,1) 

+ 5(2,2,0,3) + 5(2,2,0,0,3) + 5(2,1,3,1) + 5(2,1,1,0,0,3) + 5(2,1,0,0,3,1) 
= 5(5,2) + 5(5,1,1) - 5(3,2,2) + 5(3,1,1,1,1) - 5(3,3,1) - 2S(2,2,2,1) + 5(2,2,1,1,1) + 2S(2,i,i,i,i,i) . 

In the remainder of this section we shall apply the lemma three times, in each case for a symmetric 
function / all of whose nonzero coefficients Ca are equal to 1 . The first and easiest application will be for 
f = Cd, where the monomials X" with nonzero coefficients are those for binary compositions a of d. 

2.3. Proposition. For c? G N and fi & Vn one has e^s^ = X^Aiep ^^[m ^ '^l ^a- 

Froof. Applying lemma 2.2 gives e^Sf^ = X^^gc'^' s^+a. Now fx -\- a can only fail to be a partition 
if for some index i one has /ij = Hi+i while (aj,aj_|_i) = (0,1); in that case one has s^+a = 
since (ytt + a) [i] = {^ + a)[i + l]. The remaining terms are already normalised, and the corresponding 
values of /i + a run through the set {AG Vn+d I M ^ A } by the definition (1.1.1) of the relation □ 

As a first consequence of this proposition one finds for = (0) that s^d) = e^, since S(o) = 1 and 
{XeVd\0^ X} = Vdn C'=' = {l^'^)}; this fact can also be easily deduced directly from the definition 
of Schur functions. Another instance of the proposition gives eis^ = X^AeP +i I'^ ^ ^'^l ^a for M € Vn- 

In this first application of lemma 2.2, it sufficed to discard the null terms in the resulting summation. 
Our second application, which will be for / =pfc, is slightly more complicated: it still suffices to discard 
the null terms (there are no cancellations of pairs of terms), but the remaining terms are not all normalised, 
and in normalising them some signs do remain (for k>2). This will in fact be the only case in our paper 
where we obtain a result that is not a positive sum of Schur functions. The monomials occurring in pk 
are just the powers Xf, which are the; monomials X" where a is a permutation of the composition (fc), 
in other words a composition of k with just one nonzero part at = k for some i G N. 

2.4. Proposition. For k > and fx €Vn one basphS^ = J2xev„+k^l^ ^'•(fc) A] {—l)^^^-^/'^''sx- 

Proof. Applying lemma 2.2 gives p^s^ = X^ieN 5;^+a(i) where a^'^ = ([* = .? ] fc)jeN is the composition 
of k with a unique nonzero part at position i. If for any j < i one has ij[i] + k = then the 

corresponding term s^_|_„(i) vanishes, because the sequence {iJ, + a^^^)[] has equal entries at positions 
i and j. We discard such terms, and henceforth suppose that no such j exists. Then the sequence 
(/Lt + q;'-'^)[] can be transformed into a strictly decreasing sequence by successively transposing the entry 
(/i + Q:('^)[i] = iJ,[i] + k with its preceding entry some number of times (possibly zero), and the resulting 
sequence is of the form A[] for some A G Vn+k- If j < i is the final position of the term + k, so that 
IJ,[i] + k = A[j], then the sign of the permutation that has been apphed is (— 1)*"-' = e(/i + a^^\X). 
Now by proposition 1.5.1 one has jj, -<r(fc) A and ht(A//i) = i — j, and the proposition follows. □ 

One obtains for instance, for fi — (0), an expression of pk as an alternating sum of "hook shape" 
Schur functions: pk = X^jgj/jj (— l)*s,/i) where 1^^*^ = {k — i,l^^^) is the partition whose nonzero parts 
consist of one part k — i followed by i parts 1; a concrete example is p4 = S(4) — S(3,i) + S(2,i,i) — S(i,i,i,i)- 

Using (15), one sees that this proposition is equivalent to the statement that for A G Vn, k > 0, and 
/ G A of degree n — k, one has 

{Pkf\sx)= Yl [M^r(fe)A](-l)^*WM)(/|s^). (22) 

Since it is known that a scalar product of the form {pki ■ ■ - Ph \ s\ ) can be interpreted as the value of 
the symmetric group character parametrised by A on the class of permutations with cycles of lengths 
ki, . . . ,ki, this gives a recurrence relation for symmetric group characters. For instance, the value of 
the Si7-character parametrised by A = (7, 5, 2, 2, 1) on a permutation of cycle type (4, 3, 3, 2, 2, 1, 1, 1) is 
(papIpIpi I sa), which can be computed either as -(p^p^pf | S(4,4,2,2,i) ) + (P3P2Pi | 5(7,5,i) ) using the 
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recurrence relation for k = 4, or as {pipsP^Pi \ •8(7,2.2,2.1) ) ^ {PiP3P2Pi \ S(7,5,2) ) using the recurrence 
relation for = 3 (one could also use fc = 2 or fc = 1, but that gives expressions with 3 and 4 terms, 
respectively); if the character tables for S13 or S14 have been previously determined, this can be evaluated 
to give the value — 3H — 2 = —4 — 1 = —5. This recurrence relation is known as the Murnaghan-Nakayama 
rule, and appears in [Mur] and [Nak]. That name is also given to the combinatorial expression for 
(Pki ■ ■ - Pki \ s\) in terms of chains of partitions related by '-<r(fc)' for varying fc, that can be obtained 
by expanding the recurrence relation recursively; however, in that form, which can already be found in 
[LiRi], the rule is rather less practical for computations, due to the large number of terms it often gives 
(which in addition depends heavily on the order in which the factors pk^ are extracted). 

Our third application of lemma 2.2 will be for / = hd] here the coefficients Cq, are nonzero for all 
compositions a G Q. Thus the summation obtained from the lemma gives many more terms than in 
the previous applications, and we must perform some real cancellations. Yet, surprisingly, the resulting 
linear combination of Schur functions will be quite similar to the one obtained in proposition 2.3, notably 
all nonzero coefficients are equal to 1. In fact the only difference is the replacement of vertical strips by 
horizontal strips. That proposition and the following one are known as the Fieri rules (or Fieri formulae). 

2.5. Proposition. For € N and ii&Vn one has hdS^ = '^xep„+d I •^l 

Proof. Application of lemma 2.2 gives has^ = J^aeCd *M+a- ^^^^ summation we must cancel all the 

terms with /i /i + a, in other words those terms with (/i + a)i+i > pLi for some index i G N. That 
inequality is equivalent to (/i + a)[i + 1] > /K[i], which means that (/x + a)[z + 1] lies outside the "safe 
interval" /j+i = { j e Z | + 1] < j < ii[i] }. Setting in addition to this /q = { j G Z | /i[0] < j }, the 
intervals li form a partition of Z, so a determines a unique map /: N ^ N such that (/i + a)[i\ G I f{i) 
for all i. Moreover, one clearly has f{i) < i for all i, so if / is injective, it must be the identity, 
which is equivalent to jj, jj, + a. We must then cancel the terms for those a for which / is not 
injective. Indeed these terms can be cancelled, since if /(«o) = / (*i) for io =^ ii, then the transposition 
of entries io and ii of the sequence (/x-|-a)[] gives a sequence of the form (/x + q;')[], with a' G Cd 
and s^+a + s^+a' = 0; either these are two distinct terms that cancel each other, or if a = a' they 
are twice the same term, which is then null already. It remains to show that cancelling terms can be 
paired up effectively, in other words that a concrete pair of indices (io, ii) can be chosen for each a € Cd 
with fj, ^ 1^ + a, in such a way that for the resulting a' the same pair {io,ii) will be chosen. This is 
easy: since the function / is the same for any a' that would cancel with a, any choice that depends 
only on / will do; for instance one can take the minimal possible pair {io,ii) in lexicographic order. □ 
Similarly to what we saw for proposition 2.3, a first consequence of proposition 2.5 is that S(^d) = hd, 
which is not quite as easy to obtain directly from definition 1.4.2 as Si(d} = is. As an illustration we 
choose a rather modest example, since the number of terms in the summation tends to grow rather rapidly: 

one has /l4S(4,2,2,l) = S(9,2,2,l)+S(8,3,2,l)+S(8,2,2,2)+S(8,2,2,2,l)+S(7,4,2,l)+S(7,3,2,2)+S(7,3,2,l,l)+S(7,2,2,2,l) + 

5(6,4,2,2) + 5(6.4,2,1,1) + 5(6.3.2,2.1) + 5(5.4,2.2.1)- Although a dozcu tcrms remain, the number of nonzero 
terms produced by application of lemma 2.2 before normalisation is considerably larger, namely 178. 
To illustrate the cancellation, consider the cancelled term 5(4 2.4,2.2) that arises for a = (0,0,2,1,2). 
Putting /i = (4,2,2,1), and comparing + a)[] = (4,2,4,2,2j[] = (3,0,1,-2,-3,-6,-7...) with 
/u[] = (3, 0, —1, —3, —5, —6, —7 . . .), we see that the entries of the former sequence that lie outside their 
respective save intervals are (/x + a)[2] = 1 ^ I2 = {—1} and (/x + a)[4] = —3 ^ = {—5,-4}. Since in 
fact 1 G /i = {0, 1, 2} and -3 G /a = {-3, -2}, the values f{i) ^ i are /(2) = 1 and /(4) = 3. Therefore 
a term s^+a' cancelling s^^+a can be obtained either by transposing the entries and 1 at indices 1 and 2 
of (/i + a)[] or by transposing the entries —2 and —3 at indices 3 and 4. The result can be written 
as (4, 3, 3, 2, 2)[] respectively as (4, 2, 4, 1, 3)[], and indeed one has 5(4 3.3 2.2) = 5(4.2,4.1,3) = ^5(4 2,4.2,2)1 
while the occurring compositions can be written as /U-t- a' for a' = (0, 1, 1, 1, 2), (0, 0, 2, 0, 3), respectively, 
so that both 5(4.3,3.2,2) and 5(4.2,4.1,3) occur in the summation. The rule of choosing the lexicographically 
first pair of indices would in fact make the term 5(4 2,4.2,2) cancel against S(4 3 3 2.2)- 

As with proposition 2.4, propositions 2.3 and 2.5 lead to recurrence relations for certain scalar 
products. In particular we obtain such relations for the Kostka numbers: 

'f^A//x,a = ( 5Mea I SA ) = ^ [ ^ A*' ] ^A//x',(ai,a2,...) ' (^3) 
Kx/^i.,a = {Sij.ha\ Sx) = ^ [ fj, ^ fl' ] Kx/^^' ,(ai,a2,...), (24) 
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for all X, IJ, G V and a G C. One can rcx'ursively apply these recurrence relations: using the basic case 
^'x/x a ~ ^'x/x a = [ ^ = (0) ] ^^'^ Comparing with definition 1.2.2, one finds the promised combinatorial 
description of Kostka numbers. 

2.6. Corollary. For X,^€:V and a gC, the Kostka number Kx/^^a equals the number # SST(A/ a) of 

semistandard tableaux of shape X/ ^ and weight a, and K'x/^^ a ^^'^ number of transposed semistandard 
tableaux of shape X/fi and weight a. As a consequence the two are related by K'xif^a ~ ^a'/m' "- '-' 

This description shows that one has #SST(A//z, a) = # SST(A//U, a(a)) for any a € Soo, and it 
turns equation (18) into a combinatorial expression of ha and e„ in terms of Schur functions. In fact, 

we may consider ha and as generating polynomials in A of semistandard (respectively transposed 
semistandard) Young tableaux of weight a by shape, in the basis of Schur functions; 

'^«= E E [wt(r)=a]sA, (25) 

AePld TeSST(A) 

ea= E [wt(T) = a]sA. (26) 

Ae-P|<,l TeSST(A') 

Besides the combinatorial description of corollary 2.6, one also has the following expressions for Kx/ij,,0 
and K'^^^ ^ that can be obtained directly from lemma 2.2, using (10) and (9): 

Kx/^,,0 = Yl #^«,/3< W \sx)= = /5] + row(M), A), (27) 

aeC MeM 

^A/M,/3 = E#^a!/3(^M+a I sa)= ^ [ col(M) = /3 ] £(m + row(M), A). (28) 

These expressions produce the Kostka numbers by an alternating enumeration of matrices rather than 
by a direct enumeration of tableaux, and are therefore rather inefficient as a means of computing those 
numbers. However it will prove useful to have, as a complement to corollary 2.6, these descriptions that 
do not involve semistandard tableaux, or horizontal or vertical strips. 

§3. The Cauchy, Jacobi-Trudi and Von Nagelsbach-Kostka identities. 

We have expressed products of elementary, of power sum, and of complete symmetric hmctions as Z-linear 
combinations of Schur functions. However we have not yet expressed Schur functions themselves in terms 
of anything else, not even in terms of monomials, other than via the polynomial division used in their 
definition. In this section we shall provide such expressions of (skew) Schur functions, in terms of minimal 
symmetric functions (which implies an expression in terms of monomials), in terms of products ha of 
complete symmetric functions, and in terms of products e„ of elementary symmetric functions. The latter 
two expressions will eventually be obtained via another application of lemma 2.2, but the expression in 
terms of minimal symmetric functions will be obtained in an indirect manner, by establishing that the 
dual basis of { mx \ X &V} with respect to the scalar product on A is in fact the set {hx \ X gV} (with 
the obvious correspondence between their elements, in other words ^ mx | /i^ ) = [ A = /i] ). Another way 
of formulating that property is that scalar products ( / | hx) can be interpreted as the coefficient of mx 
in the expression of / as linear combination of minimal symmetric functions, and hence that {f\ha) 
can be interpreted as the coefficient of X" in the expansion of /[ATn] into monomials. This will imply in 
particular that Kx/^,a can be interpreted as the coefficient of X" in S;^/^[XN]. 

Let us first show that { ft-A | A G P } is a Z-basis of A in the first place. Since its elements are precisely 
the monomials that can be formed from the set { hi \ i > }, this amounts to showing that those hi are 
algebraically independent generators of A, which can then be viewed as a polynomial ring Z[hi,h2, ■ ■ ■]■ 
One has /i^ = Ylxev ^x.fiSx for ^ G Vn, while { sa | A G "P } is known to be a Z-basis of A, so we can 
obtain our goal by showing that the matrix {Kx.fi)x.fie'P„ of Kostka numbers is invertible for every n. 
That will follow from the fact that this matrix is unitriangular for the dominance ordering on Vn- 

3.1. Proposition. For X G Vn a-nd a G C„, Kx^a implies < X; moreover Kx.a = 1 if ci^ = A. 

Proof. Since Kx,a = Kx^a+ we may assume a = G "P. We use the description of Kx^a in corollary 2.6. 
For any Young tableau (A^'^)jgN G SST(A), the diagram [A^*^^] is contained in the topmost k rows of [A], 
for all A; G N. The existence of (A(*)),eN G SST(A, a) therefore implies 'Zielk] = l^^''^! ^ E,e[fe] 
If a = A, we see that [A^'')] must coincide with the topmost k rows of [A], whence #SST(A, A) = 1. □ 
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A combinatorial proof of this proposition for general q G C, without using K\,c, = K\.a+i can also 
be given, but is slightly more awkward to formulate (one basically argues that the number of squares 
contributed by any column to the first k rows of [A] is at least as large as the contribution, in any 
T G SST(A, a), of the k largest parts of a to that column). But we shall only use the case a E P. 

The proposition implies that the matrix {Kx,^)x,fi£'p„ is upper unitriangular, and hence invertible, if 
Vn is ordered in such a way that A comes before n whenever X> fi, for instance by decreasing lexicographic 
order. Since, by corollary 2.6, {K'^ fJ\,ii£'P„ is obtained from that matrix by a permutation of its rows 
(the permutation being given by the map A A*: — > Pn), we may conclude from = "^xeVn 
that A is also a polynomial ring Z[ei, 62, 63 . . .], a fact that is sometimes called the fundamental theorem 
of symmetric functions, and that is usually proved without using Schur functions. 

To prove that the basis {h\ \ \ eV} is dual to { m\ | A G P }, one may establish, for / running 
through some basis of A, that / — Ylx^v ( / | ''■A or equivalently that /[ATn] = Ylaec ( / | )X°^. 
It is not practical to use the basis of Schur functions for this, since in that case those expressions are 
precisely what we would like to conclude from duality. However for / = e/3 or for f = we already 
know their expressions in terms of monomials, which are given in equations (9) and (10), while the 
scalar products that should coincide with the coefRcients in those equations are easily computed. Indeed 
rising equations (18) to develop these elements in the orthonormal basis of Schur functions one finds 
(cfj \ ha) — J2xep ^'x i3^>^,a ''^^'^ {^fj \ ha) — J2xeP K\,l3^\,a- Thus we need to show the following. 

3.2. Proposition. For every a,0 gC one has 

#Ma,0 = Kx,aKx,0 (29) 
xev 

and 

= E ^A.«^A,/3- (30) 

Proof. This is the one place where we import a result from elsewhere. The proposition expresses 
the enumerative consequences of the bijections between integer matrices and pairs of semistandard 
tableaux of equal shape defined in [Knu], the first of which is known as the RSK-correspondence. □ 

3.3. Corollary. The Z-bascs {hx \ \ E V} and { mx | A G P } of A arc dual to each other with respect 
to ( • I • ); for every / G A one has f = "^xev ( ^a | / )nix, and consequently /[An] = '^aec ( I / )^"- 

Proof. Since we have shown that {hx | A G "P } and { eA | A G P } are Z-bases of A, either of the special 
cases f = hfj or f = Bfj that are given by proposition 3.2 suffice to prove the general case / G A. □ 

Applying this corollary to / = Sx/^ gives the expressions that we were after, of (skew) Schur functions 
in terms of minimal symmetric function and in terms of monomials: 

^a/a( = E -'^a/m,!."^.^, (31) 

SA/^[^N] = E^AMaX"= E (32) 

c«ec TessT(A/M) 

For ji = (0), the final expression describes sa[An] as the generating series in Z[[An]] of semistandard 
Young tableaux of shape A by weight. This is often taken as the definition of Schur functions (for instance 
in [Stan]), but in our setting it is a nontrivial statement that provides an alternative combinatorial 
interpretation of semistandard Young tableaux T G SST(A, a): rather than corresponding to constituent 
Schur functions sx of ha, they now correspond to constituent monomials X" of sa[-^n]- One may ask 
whether this result can be proved in a bijective manner. The way in which we obtained it does not 
qualify as a bijective proof: although the main ingredients (lemma 2.2 and proposition 3.2) are based on 
explicit correspondences, we did need some (linear) algebraic reasoning. The most direct possible bijective 
proof would be to give, for / = X^Q,gNn X!TessT(A a) ^ 2[^[n]]i ^ correspondence between terms that 
establishes /A„ = as^+x[X[n\\- That would require a fairly complicated correspondence, but there is a 
simpler possibility that nevertheless is more or less equivalent: taking / = X^7'gsST(A) 

and /3 = (0) 

in lemma 2.2, its right hand side X^T6SST(a) ■^wt(T) should reduce by cancellations to sx, and one may seek 
to describe such a cancellation process explicitly. For instance for A = (2, 1) the right hand side, restricted 
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to wt(T) e N^"* thanks to proposition 2.1, gives S(2a) +S(2,o,i) + •''■(1.2) +2s(i,i,i) + 0,2) + •''■(0,2,1) +'S(o, 1,2); 
here the terms S(2,o,i)j -^(1,2)7 ^^nd S(o^i,2) are null, and the remaining terms except the initial one can be 
cancelled using = ^ 1) + S(i,o,2) + ■5(o.2,i)- We shall see below, when we consider products 

of Schur functions, that an explicit and general description of such a cancellation can indeed be given. 

Now that we have identified the generating series of semistandard Young tableaux of a given shape 
by weight, we may turn the equations of proposition 3.2 into generating series identities by multiplying 
them by X°'Y^ and summing over a, /? e C; one obtains 

n Y^xY ^ ^ «A[^N]sA[i^N] (33) 

and 

[] (1 + XiY^) = sx[X^W [YM- (34) 

The first of these is called the Cauchy identity (although it is difiicult to justify the attribution; for 

details sec [Stan, p. 397]), and the second the dual Cauchy identity. From (12) and (33) one deduces 
(after combining terms in the left hand side of the former with a common value (3'^ = A) that 

hx [X^]mx [Fn] = ^ sa [X^]sx [Y^] (35) 

AG-p AGP 

in Z[[Xn, In]], which is an alternative formulation of the duality of corollary 3.3. Similarly (11) and (34) 
give 

Y ex[X^]mx[Y^] = ^ sa'[^n]sa[1^n]. (36) 

AG-p AGP 

This equation does not express a duality like the previous one, but one can relate the two equations using 
the Z-linear involution w: A ^ A that maps Sx 1— > Sa» for all X gV. By (18), the fact that K'^ ^ = Kx\a 
implies uj{ha) = for all a S C, so w coincides with the ring morphism A ^ A that sends hi for 
alH > (which exists since A = Z[/ii, /12, • • •])• Then (36) can be obtained from (35) by applying, to the 
subring of Z[[Xn, ^]] that is the the image of A ® A via f ® g ^ /[X]sr]g([lisr], the ring automorphism 
corresponding to the automorphism ® 1 of A Cg) A. (Actually that is not quite true: one needs to take 
the closure of the subring in the topology of power series rings, and extend the automorphism to that 
closure by continuity.) 

We now proceed to express skew Schur functions as Z-linear combinations of elements h^. Since 
duality of bases is a symmetric notion, the coefficient of hi^ in the expression of any / G A in the basis 
{hx \X€'P} can be computed as ( niv | / )• Applying this for / = Sx/^,, we see that its coefficient of h^ 
is ( rUi, I sxj ^ ) — { s^nii, | sx)- This number can be determined by applying lemma 2.2 for irii, (a case 
that we did not yet exploit before), which gives as result the expression X^aGct'^"'' ~ i^j £(/U + a, A). This 
shows that 

*A//i = X! X^t*^^ = z^]£(/i + Q!,A)/i^ = ^e(/i + a, A)/Iq;, (37) 
i/ePaec aec 

since hi, — ha when a+ = u. This achieves the expression of skew Schur functions in terms of complete 
symmetric functions, but we shall proceed to make the expression more transparent. The process will be 
somewhat different from what we have seen before, as we shall not actually cancel any pairs of terms, 
nor even group together terms for values of ha that are equal. 

The nonzero terms of the final summation of (37) can be found by permuting the terms of A[] in all 
possible ways such that each term of the resulting sequence o-(A[]) is at least as large as the corresponding 
termof^[] (in formula, A[CT^^(i)] > /x[i] for alH e N), which condition we shall abbreviate as ct(A[]) > A^[]; 
then a = cr(A[]) — gives a contribution e{a)ha- From the proof of proposition 2.1 we see that if 
A £ N', then one needs to consider only permutations a G Si. For instance for X/ H = (5, 4, 2)/(3, 1), 
only permutations of the 3 initial terms of A[] = (4, 2, —1, —4, . . .) need to be considered, and only the 
permutations giving (4,2,-1), (2,4,-1), (4,-1,2), and (2,-1,4) satisfy £7(A[]) > n[] = (2.-1.-3,...); 
for this case (37) therefore gives S(5,4,2)/(3,i) = /i(2,3,2) - /i(o,5,2) - '^(2,0,5) + ^1(0,0,7) = /i(3,2,2) - 2/1(5,2) + h^. 
We can now write (37) as 

«A/^ = Y [^('^[D ^ ] ^('^) '*<^(A[])-/^[]- (38) 
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With the convention that hi = when i < (which wiU take care of the factor [ (t(A[]) > ] ), we can 
write this expression as the determinant of a matrix whose entries are complete symmetric functions: 



sa/p = det(/iA[i]-^[j]) when A e N'. 

This identity is called the Jacobi-Trudi identity. For the example above, it gives 



(39) 



S(5,4,2)/(3,l) 



h2 /i5 hr 
ho hs /i5 
ho h2 



h{3,2,2) - 2/l(5,2) + hr. 



To obtain an expression of skew Schnr hmctions in terms of elementary symmetric functions, we may 
apply the automorphism u) to equation (37), giving 



aeC 

from which one deduces as above, again with the convention that ej = when i < 0, that 



det(eA[, 



*l-M[i]/i,j6[(] 



when A G N', 



or equivalently 



sx/^l = det(e;^t[i]_^t[j]) . ..gj^j when Aq < r. 



(40) 



(41) 



(42) 



This final equation is sometimes referred to as the dual Jacobi-Trudi identity, but we prefer to call it 
the Von Nagelsbach-Kostka identity (following Littlewood, see [Litt, p. 87], but restoring the spelling 
that had been lost in translation). We note that Kostka proved the identity in [Kos] by an ingenious 
manipulation of determinants representing alternants, see [Muir, p. 155]; see also [Stan, p. 397]. 



§4. The Gessel-Viennot correspondences. 

Let us compare equation (27), which expresses the Kostka number i^A//*,/? as an alternating sum over 
integral matrices with column sums given by (3, with the description of Kx/^^fj as counting semistandard 
tableaux of shape A//i and weight /?. The integral encodings M of such tableaux (given in definition 1.2.3) 
are also matrices with col(M) = /3, and since moreover row(M) = A — /U, these matrices contribute +1 to 
the alternating sum in (27). Therefore these integral encodings give a subset of its terms that completely 
accounts for the value of the alternating sum; apparently all remaining terms cancel out. We can combine 
the observed identity for all [3 into a generating series identity 

/3gC M^M MeTabl(A/^) 

where the second member is obtained from (27), and the third member from corollary 2.6 by taking 
integral encodings. In this section wc shall discuss the reduction of the second member to the third 
member by cancellations, and a similar question for an identity derived from equation (28). 

We know from (32) that the first and last members of (43) equal Sa/m["^n], and the cancellation that 
we consider is often associated with the Jacobi-Trudi identity. Indeed, if one takes equation (10), slightly 
rewritten as /laf-'^N] = Sm6Ai[^°''^(-^) ~ "^1 X™''^', and substitutes it into (37), then one obtains 

Sx/,[Xn]= E e(M + row(M),A)X-'W, (44) 

whose second member matches that of (43). We note however that we did not need (32) or (37) to 
obtain (43). 

In fact, the way we obtained (43) suggests a way to understand how to cancel terms in it. Its second 
member was originally obtained from an application of lemma 2.2 for multiplication by hp, while its 
third member was obtained by repeated application of proposition 2.5 for individual multiplications by 
factors hjs^ , each of which uses lemma 2.2 followed by a pruning of the summation. It suffices to postpone 
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such intermediate pruning. Stated differently, for each term of the second member that ncxxis to be 
cancelled, one needs to find the factor h/}. responsible for this fact, and cancel the term correspondingly. 

Let M e. ^A with col(M) = /3. Each column Mj represents the exponent of a term in the factor hp. 
of hp, so that the entire matrix represents the monomial X™'^'^^^ in the expansion of hp. If we have 
M G Tabl(A//z), which means that M is the integral encoding of some semistandard tableau T of 
shape X/ fi, then the sequence of partitions {X'^^^JieN that defines T can be found by starting with /x and 
successively adding the compositions Mj for increasing values of j: A^'-* = /x + X]j<j ^j- Therefore the 
condition that M £ Tabl(A/y^) amounts to the requirement that /i + X^j<; Mj ^ fi + X^j<; Mj holds for 
all I G N, and that moreover one has /i + row(M) — A. 

Now suppose that M ^ Tabl(A//i), so that M defines a term of the second member of (43) that has 
to be cancelled. Should only the final part /i + row(A'/) = A of the condition M E Tabl(A//i) fail, then 
H + row(M) is a partition distinct from A (recall that a a' implies that a, a' G V), which means that 
e(/x + row(M), A) = 0, and the term defined by M is null (this case only arises because (43) just deals 
with the coefficient ( /i/j | s^/m ) = ( s^hp | sx ) rather than with all of Sf^hp). 

Putting that case aside, there must be an index / for which fi + 5]]j<; ^j ^ + Y^jKi ^^'^ 
choose the minimal such I; this means that in the computation where .s^ is successively multiplied by the 
factors hp., the term determined by M is cancelled after the multiplication by hp^. At that point the 
term is Sa with a = /x + J2j<i ^^'^ it is cancelled against a term Sa' defined similarly, but possibly 
with a different composition instead of the final term M/ of the summation (as usual we do not exclude 
the possibility a' = a, if already = 0). From the proof of proposition 2.5 we recall that a' is such that 
the sequence a'[] is obtained from a[] by interchanging its entries at appropriately chosen distinct indices 
io and ii. Those indices were chosen in a manner that ensures that the value that should replace M/ in 
the expression for a' is a composition, i.e., it has no negative coefficients. That composition is in fact 
obtained from Mj* by a transfer between the entries Mi^^i and Mi^^^i, by which we mean the replacement 
of those entries by some pair of natural numbers with the same sum. 

We have considered M truncated to its columns Mj with j < I, and found another such partial matrix 
such that their contributions to Sfj_hpg ■ ■ ■ hp^ cancel. It is clear algebraically that the total contribution to 
s^hp from matrices that extend the first partial matrix cancels the total contribution from matrices that 
extend the other partial matrix. However, to describe the cancellation in (43) combinatorially, we have 
to match up the term for M with a specific term cancelling it, taking into account also the columns Mj 
with j > I. The obvious way to do this is to exchange, in addition to the transfer between Mj^^; and Mi-^^i, 
the entire remainders of rows io and ii to the right of those entries; in this way it is guaranteed that the 
values of (/x + row(M))[io] and (/x + row(M))[ii] are interchanged between a pair of matching matrices. 

Here is a simple example to illustrate the construction of cancelling matrices. Let A//(x = (7, 5, 4)/(l) 

and 

12 3' 
M= 12 12 

3 10, 

so that /? — col(M) — (6,4,5). Then one already has /x + X]j</ M + ^j<i ^'^^ ^ = 0, the concrete 

values being /x = (1) (2, 2, 3) = ^ + Mg. Setting a = /x + Mg one has a[] = (1, 0, 0, —4, . . .), and the 
indices {io,ii) can be chosen in various ways. For instance for {io,ii) = (0,1) one finds the sequence 
a'[] = (0, 1, 0, —4, . . .) for a' = (1, 3, 3), which can be realised by replacing Mg by (0, 3, 3). But one can 
also take (io, ii) = (1, 2), which permutes two identical entries of a[] and therefore gives a" = a, and an 
unchanged cohunn Afg (and indeed one has S(2,2,3) = 0), or (io,ii) = (0,2), which realises a'" = (1,2,4) 
with a"'[] = (0,0,1.-4....) by replacing Mg by (0,2,4). Performing the indicated changes to M, we 
find that the contribution to s^^hp from M could be cancelled by either one of the contributions from the 
matrices 

/O 1 2\ /I 2 3\ /O 1 0' 

M' = 3 2 3 , M" = 2 1 , or M'" =212 
\3 10/ \3 1 2/ \4 2 3^ 

Indeed one has S(7,5,4) = ^ -9(4,8.4) = —5(7.3,6) = — S(2,5,9) (the subscripts were respectively computed as 
/X + row(M), fj, + row(M'), + row(M"), and /x + row(M"')), which shows that the contribution of M 
to the second member of (43) could cancel against that of M', of M", or of M'"; the rule of choosing the 
minimal possible {io, ii) would in fact select M'. Apart from the operations involved in the cancellation, 
the example illustrates the fact that when we cancel Sa against Sa' with a ^ a', one might still have 
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Sa = 0; it also shows that cancelling a null term Sa in s^hfj^ ■ ■ ■ /1/3, "against itself" might lead to the 
cancellation of two distinct terms in 

There is a beautiful graphical way of visualising this cancellation, and the terms that survive it, 

due to Gesscl and Vicnnot [GeVi]. Set a*^^-' — 1^ + Mt for j G N, so that M identifies the term 



s^U) in the development of s^^hp^ • ■ • h 



The idea is to trace for each fixed index i the evolution of 



the number a^^'>[i], as j increases, by means of a lattice path: a path connecting points of using 
unit steps in two orthogonal directions, in the current case down and to the right. One has a separate 
path for each i G N; path i passes through all of the points (j, q(-'^[«]) for j G N. These points are 
connected as follows: after arriving at (j, a*^-'-' [i]) , the path takes Mij horizontal steps (so the second 
coordinate becomes a(^^[i] + Mij = a^^~^^^[i\), followed by a vertical step to arrive at [j + l,a^^~^'^^[i]). 
If I > 0, this portion of path i remains disjoint from path « — 1 if and only if a'^-''^"'^) [i] < a^^^fi — 1], in 
other words if the addition of Mij to a^^^ [i] leaves it inside what we called the safe interval /j in the 
proof of proposition 2.5. Thus all paths will remain disjoint when taking into account column Mj if and 
only if a(j) ^ a'^) + Mj = a^^+i); they will remain disjoint forever if and only if M € Tabl(A'//i) for 
some X' gV. 

For any M, path i starts at (0, and eventually runs along the vertical line {fj, + row(M))[i]) . 
The term for M in the second member of (43) is nonzero if and only if these vertical lines, for i S N, 
form a permutation of the lines (*, A[i])jgN5 whose sign then gives the coefficient of the term. If moreover 
/x+row(M) is a partition, which must necessarily be A given the previous condition, then this permutation 
will be the identity, so that path i runs from the point (0,/Lt[i]) to the line (*, X[i]). This is in particular 
(but not exclusively) the case whenever the paths are disjoint, so that M € Tabl(A//i). 

As an example of a term that survives the cancellation, and of the family of disjoint paths that corre- 
sponds to it, we recall from §1 the integral encoding M G Tabl(A//z) of the tableau T G SST(A//tx) of (2), 
where X/ jjL = (9, 8, 5, 5, 3)/(4, 1). To facilitate recognition of the compositions (in fact partitions) a^^\ 
we have written the coefficients of /x along the left border of M. The paths, taken from right to left, 
correspond to the rows of the matrix from top to bottom, and to the rows of T in the same order. 
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On the other hand, for the terms that must be cancelled, we have that whenever two paths of the 
associated family touch, then a family of paths with opposite contribution can be formed by exchanging 
these two paths after the point of contact; if the term is nonzero, this exchange will change the sign of the 
permutation defined by the family. This exchange corresponds to the type of cancellation we considered 
above. Indeed if these are paths io and i\, and they have point (j, m) in common, then 

max(a(^)[«o],a^^''[n]) < m < rmn{a^^+^\iQ\,a^^+^\ii]), (45) 

and the exchange corresponds to transferring the difference a(-'+^^[io] — a'^-^+^^[ii] from Mj^j to Mi^j 
(which leaves both entries > due to the inequality between the outer members of (45)), followed by the 
interchange for all f > j of the entries Mig j> and Mi^ j'. 

Looking back at proposition 2.5 from this path point of view, we see that in its proof there is even 
more freedom in matching cancelling terms than we used: the existence of a common point on paths 
io and ii is all that is needed in order to obtain a cancellation by a transfer between parts iq and ii, while 
we used the stricter requirement /(io) = /(*i)- (To make this clear, recall that the function / was defined 
relative to the multiplication by a single factor hj and under the assumption that a^^^ is a partition, by the 
condition a^^+i' [i] G //(,) , where 7o = { G Z | q(^) [0] < fc } and = { fc G Z | a^^) [i] < fc < a^^) [«-!]} 
for i > 0. Then for ii > io there is a common point with vertical coordinate j on paths io and ii if and 
only if io > f{ii), which is certainly the case if /(ii) = /(io)-) 

This extra freedom does not increase the number of terms that cancel, it just gives more possibilities 
of choosing (io,ii) for those terms that do allow cancellation. Of course, to complete a proof that the 
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second member of (43) reduces to the third member, one must give up this freedom and make a concrete 
choice that defines a sign-reversing involution on the set of cancehing terms. A fairly natural choice to do 
so is to choose the top-leftmost point that lies on more than one path, in other words take j minimal in the 
discussion above and then m = a^^^ [iq] where = max { i \ a^^'^^^ + 1] ^ oi^''^ H }; then ii — iQ + 1, and 
only paths Iq and ii pass through {j,m), which makes it easy to show that after exchange of paths this 
rule finds the same point {j, m) and the same pair (zq, Indeed the fact that we took some pair (iq, ii) 
with J{ii) ~ fik)) in the proof of proposition 2.5 seems a bit artificial in retrospect (regardless of exactly 
which pair was chosen); in particular there may be no such pair with ii = «o + Ij so the common point 
used need not be the first one where path ii meets another path. 

One can proceed analogously for equation (28), comparing it with the description of K'^^^p as 
counting transpose semistandard tableaux of shape A//z and weight /3. We have not defined binary 
encodings of such tableaux T (or, for that matter, integral encodings), but if we take as columns differences 
of successive partitions of T, we obtain a binary matrix M whose transpose is the binary encoding of the 
transpose of T. The situation is in fact quite similar to the one considered above for (27), as we have 
col(M) = /? and row(M) = X — n, whence M contributes +1 to the summation in (28), and all such 
matrices account for the entire value of the summation. Forming a generating series by /3, we obtain 

J2k'xM^''= E £(/^ + row(M),A)X-'W= Yl (^6) 

The first equality is related to the Von-Nagelsbach-Kostka identity, since the first member of the equation 
equals sa»/m»[^n] by (32), and the second member can be obtained by substituting (9) into (40). 

Before we consider the reduction of the second member of (46) to its third member by cancellations, 
we transform the equation slightly by taking transposes of tableaux and matrices, so that we will be 
dealing with ordinary semistandard tableaux and their binary encodings. At the same time we replace 
A//U by A'//x' so that the members of the equation represent SA/^f-'^N] rather than sxt/i^t[Xf^]; we get 

J2 £(m' + col(M), A')^™^^^) = (47) 

MeMPl MeTablPl(A//n) 



cn 



Let M £ AlP' with row(M) = a. Each row Mj represents a monomial X ' in the factor Cq. of 
and the entire matrix represents an occurrence of the monomial X'^°^^^^ in the expansion of e^- Put 
^(») = -\- Mj/ for i e N, then M G Tabl'^'(A//^) means that e V for all i (the remaining 

requirements for the relations ^ are automatically satisfied) and that = A' for i sufficiently 

large (in other words /i* + col(M) = A*). As i increases, M identifies a term s^(i) in the development 
of s^teao • • -^ai-i, until possibly at some point /3^*) ^ V, in which case s^(i) = 0. If this never happens, 
then M also identifies a term s^t_|_(,oi(M) in the expression of the final value s^tCa in the basis of Schur 
functions. In this case M gives equal contributions to both members of (47), which contributions are 
nonzero if and only if /z* + col(M) = A* or equivalently M € Tabl'^'(A//x) (the contributions are zero 
when M € Tabl'^'(A'//i) for some partition A' 7^ A; as before this case needs no special consideration). 

Now suppose to the contrary that one does have s^(i) = for some i, and fix i to the minimal such 
value. We see that the situation is somewhat simpler than before, in that there is no need to modify 
row Mi to find a matching term that accounts for the cancelling of this contribution to s^tea^ ■ ■ ■ eQ._i. 
However the contribution S|J,t_^_ca^M) to s^te^ might still be nonzero, and in order to find a matrix with a 
cancelling contribution (whose rows up to and including Mj will be the same as those of M), one needs 
to choose a column index j that "causes" the vanishing of s^(«). Concretely, the fact that s^«) = 0, 
while /J^'^ was obtained by adding the binary composition A4i to the partition implies that the 

sequence /3*-*^[] has equal adjacent entries at some position (perhaps at several different positions). So 
choose some j for which = f3^^^\j + 1], for instance the minimal one. Then to find a matrix whose 

term cancels that of M, we may interchange the parts of columns j and j + 1 beyond row M^; the 
cancellation is assured by the fact that this modification causes the interchange of the entries at positions 
j and J + 1 of the sequence (/x* + col(M))[]. 

A simple example will make this more concrete. We take A//i = (7, 5,2)/(2) and 



M 



/O 1 1 0^ 

10 10 10 1 

10 10 

Vi 1 1 1 0. 
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so that, a = row(Af) = (2,4, 2,4). One has = m' = (1, 1) and = (1, 1, 1, 1) which is still a parti- 
tion, but P^^^ = (2, 1, 2, 1, 1, 0, 1) is not, and indeed s^(2) — 0. As cause for the vanishing of s^(2) one can 
indicate either of the two repetitions in the sequence = (+1, —1, —1, —3, —4, —6, —6, —8, —9, . . .), 

and interchanging either columns 1 and 2 or columns 5 and 6 beyond row 1 leads to two matrices whose 
contribution could cancel against that of M, namely 



M' 



/O 1 1 0\ 

10 10 10 1 

10 10 

\l 1 1 1 0/ 



and 



M" = 



/O 1 1 0\ 

10 10 10 1 

10 10 

\l 1 1 1/ 



Indeed one has 5(3 3 2.1,3.1,1) = ^S(3. 1,4. 1,3.1,1) = "5(3 3 2,1.3,0.2) (the subscripts were respectively computed 
as /Lt* + col(M), ji* + col(M'), and /i* + col(M")), which shows that the contribution of M to the left 
hand side of (47) could cancel against that of M' or of M"; since e((3, 3, 2, 1, 3, 1, 1), A') = —1, the 
former contribution is in fact and the latter are both The rule suggested above 

of choosing j would pair up M with M' . 

Again we can visualise the cancellation using lattice paths that trace the evolution of the individual 
terms of the sequence /?^'-*[] as i increases. In order that all possible paths (subject only to the condition 
of eventually running in a prescribed direction) can be obtained, each bit Mj should directly control the 
direction of segment i of path j, choosing between two possible directions (note the difference with the 
correspondence above for integral matrices, where Af^.j determined the number of steps to the right on 
path i at horizontal level j). For reasons that will become clear below we choose the possible directions in 
the binary case to be downwards (for bits 0) and leftwards (for bits 1). We want the points on all paths 
that represent terms of the same sequence to line up, and given the directions in which the paths 

progress, this alignment must be along diagonals in 7?. Thus we arrive at the following rule: path j starts 
at the point (— — on the main diagonal, and passes through the point (i — — /3^''[i]), 

which lies i diagonals below the main diagonal, for each i G N. If for j ^ j' paths j and j' have a point 
in common, then this happens for the same value of i, and it means that = AH paths will 

eventually run straight downwards, with path j running along the vertical line (*, — (/i* + col{M))[j]). 
The matrix M will give a nonzero contribution to the left hand side of (47) if and only if those lines 
for J G N are a permutation of the lines (*, — A*[j]), and M will then give a nonzero contribution to the 
right hand side of (47) if and only if the paths remain disjoint, in which case the permutation must be 
the identity, so that /i* + col(Af) ~ A*. Wc illustrate such a family of disjoint paths corresponding to 
the same semistandard tableau T that was used before, together with its binary encoding M. Note that 
columns of T correspond to those of M, and to the paths, all in the same order from left to right. 
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For families of paths that are not disjoint, cancellation can be realised as before by choosing a pair of 
paths that meet, and exchanging their parts beyond the first point of contact; for the matrices this means 
exchanging the lower parts of the corresponding columns. 

Now that we have two families of lattice paths related to any semistandard tableau T, a natural 
question is how the two are related. It turns out that with a slight transformation the two families can 
be aligned so that there is exactly one crossing of members of the two families for every square of [A//x]. 
We show the two families shown above after transformation, first individually and then in superposition. 



20 



4 The Gessel- Viennot correspondences 




We have in fact, for aesthetic reasons, prefixed vertical segments of one unit to all paths of the first 
family, and of half a unit to all paths of the second family. The combinatorial explanation for this nice 
superposition is that, if T is given by the sequence of partitions (A^''^)feeN, then the first family shows the 
evolution of the sets { A^''^ [i] | z S N } as fc traverses N, and the second family shows the evolution of the 
complementary sets { — 1 — (A*^'^'')*[j] | j G N} (the term —1 needed for complementarity was left out of 
the coordinates of the second family for simplicity, but is taken into account in the superposition). Indeed 
one finds for matrices that are integral and binary encodings of a semistandard tableau T = (A^'^-')fegN 
of shape A//i, that in our descriptions above a^^^ = A'-'-', respectively that Z?*^*^ = (A^*^)*. 

For the first family of paths, the values A*^*^-* [i] are most clearly read off at the middle of the vertical 
segment from (fc — 1, A'^'^-' [i]) to {k, \^ while for the second family of paths, the values of — 1 — (A'^'^^)*[j] 
can be read off on a diagonal passing through the vertices; this explains the necessity of half a unit of 
vertical displacement between the triangular lattices that are the images of the lattice under the two 
transformations, in order to superimpose the lines that correspond to the same sequence A^'^^[]. 

This section would not be complete without a visual explanation of the superposition of families 
of paths. The form of path i of the first family describes the evolution of row i, with i fixed, of the 
Young diagrams [A'-'^-'] for A: G N, so it has the same shape as the path along the boundary of a single 
"row" (i.e., subset with first coordinate fixed) of the 3-dimensional "block diagram" corresponding to 
the semistandard tableau, namely { {i,j,k) S N'^ | £ [A*^*^-*] } in which each element is represented 
by a cube. To match the way we drew the lattice paths, the coordinate k must be made to increase 
downwards, and the coordinate j to the bottom right. With the coordinate i increasing to the bottom 
left, we obtain, for the tableau T of our example, the display below. The first picture shows just the 
block diagram, with the top of the "pillars" for the squares of the diagram [/i] — [(4, 1)] omitted, so that 
the visible tops of pillars correspond to the squares of [A//x]. In the second picture we have added the 
paths along the "row boundaries", which trace the boundary of the block diagram at fixed value of i. 




This display has as defects that the relative position of the paths is not correct (so cases where paths 
should have a common point would still appear disjoint), and the paths along "column boundaries" (with 
j fixed, so running from top right to bottom left) do not resemble the second family. Both points can 
be remedied simultaneously, by raising (in the sense of decreasing k) each "row" i (the plane of cubes 
with that value of i) by i units. This amounts to using, instead of the block diagram of the semistandard 
tableau (A''^-')fcgN, the set { {i,j,k — i) \ G [A^*^-*] } C N'^. We show the resulting diagram, and then 
add the two superimposed families of paths. 
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§5. The Littlewood-Richardson rule. 

We shall finally apply lemma 2.2 for f = sx and even for / = sa/«i so as to obtain an expression for 
multiplication by (skew) Schur functions in the basis of Schur functions. As usual, the sport will be to find 
a cancellation of the resulting alternating sum that leaves only positive contributions (from representation 
theoretic interpretations of Schur functions it is clear that the decomposition of the product in the basis of 
Schur functions cannot involve negative coefficients) . We shall thus find a combinatorial description of the 
decomposition of a product sxSfj, into Schur functions that is equivalent to the Littlewood-Richardson rule. 
In fact we shall obtain a generalisation that was given in [Zel] , which describes the scalar product of an 
arbitrary pair of skew Schur functions. Our approach naturally leads to two different formulations of this 
result, one in terms of binary matrices and another in terms of matrices with coefficients in N. Both are 
equivalent to a more traditional formulation in terms of Littlewood-Richardson tableaux, but the relation 
with cancellations in alternating sums gives a perspective somewhat different from for instance the one 
we took in [vLee] , where the Littlewood-Richardson rule was approached via Robinson's correspondence 
and jeu de taquin. We would like to note however that coplactic operations provide a very elegant means 
to unify the two points of view, as we shall indicate in a sequel to this paper. 

As an introduction we shall discuss the question that was evoked after equation (32), of proving 
bijectively that Schur functions are the generating series of semistandard Young tableaux of given shape 
by weight. The suggested proof was to apply lemma 2.2 for /[Xn] — J2TeSST{\) X""^^"^^ and (3 = (0), 
and prove that its right hand side reduces by cancellations to s\. If we pretend not to know already that 
this generating series /[X-ys] is a Schur function, then there is in fact another obligation, namely to prove 
that /[Xn] is a symmetric function, so that lemma 2.2 can be applied in the first place. Proving this 
symmetry can, surprisingly, be done in almost the same way as proving the mentioned cancellation 

X! ■Swt(T) = Sx. (48) 
T6SST(A) 

The symmetry of /[^n] can be established by giving for each m an involution on SST(A) that 
interchanges components m and m + 1 of the weight, which means, in terms of the display of the tableaux 
that the frequencies of the entries m and m -\- 1 must be interchanged, while preserving the shape. Since 
there is no reason to change other entries than m and m + 1, this involution can be viewed as operating 
only on the component shape A'^'"+^'' of tableaux (A'*^).;^^ in SST(A), which shape is constrained by the 
relation A^™' ^ xim+i) ^ x(rn+2) ^j^j^^j^ ^y^^ shapes A*™) and X("'+^) are fixed. 

As for the cancellation (48), the one tableau T in the summation whose term survives the cancellation 
will be the one whose display has only entries i in any row i, and which therefore satisfies wt(T) — A. 
For any other T e SST(A), let i be minimal such that row i of the display of T has some entry other 
than i (by column strictness, such entries must be greater than i), and let the final entry of that row, in 
column J = Ai — 1, have value m + I (so m > i). We shall consider this entry as the one that triggers 
the cancellation of the term for T, and in constructing T' e SST(A) whose terms cancels that of T, we 
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shall leave that entry intact, as well as all rows i' < i, so that the same entry m + 1 also triggers the 
cancellation for T'. In fact, we shall only change entries m and m + 1 in the display of T, if any, with 
the goal of interchanging the values of wt(T)[TO] and of wt (T)[rn + 1]. The entries that may be changed 
are in rows i' > i and in columns j' < j of the display of T, and one can treat the subtableau of entries 
m, m + 1 in that region in isolation: whatever subtableau of entries m, m + 1 should replace them, it 
will be compatible with the entry m + 1 at position Moreover, since that single entry causes the 

contribution to wt(T)[] of the part outside the specified region to have equal values at indices m and m+1, 
the goal must be to interchange the frequencies of entries m and m + 1 in the mentioned subtableau. 
Thus, the problem is basically the same one as for proving symmetry of the generating series /[Xn]- 

The simplest involutions establishing that symmetry are those that were introduced in [BeKn] , and 
which we shall call the Bender-Knuth involutions. They were in fact introduced in the combinatorial 
proof of a lemma that essentially states our equation (48) (lemma 2 of the cited paper), although an 
application to the symmetry of / immediately follows it. The idea is straightforward: one ignores any 
columns containing both entries m and m+1 (such entries cannot be changed), after which all rows can 
be treated independently, and in each of them one separately interchanges the frequencies of m and m+1; 
we refer to the proof of [Stan, theorem 7.10.2] for details. 

A simple example will illustrate these involutions and their application to establishing (48). Take 
the semistandard Young tableau displayed as follows 
























1 


2 


2 


2 


4 


4 


4 


3 


3 


3 


4 


5 


6 


6 


4 


5 


5 


6 


6 






6 


7 


7 











The Bender-Knuth involution that interchanges the components 5 and 6 of the weight, operates separately 
on the singleton sequence [6] in row 4, on the sequence [5,5,6] in columns 1,2,3 of row 3, and on the 
sequence [6,6] at the end of row 2 (the entries 5 and 6 in column 4 are frozen); the sequences are 
respectively turned into [5], [5,6,6] and [5,5]. As for the cancellation (48), it is triggered by the final 
entry 4 of row 1, and therefore operates on entries with values 3 and 4; that final entry does not participate 
in the exchange, nor do the frozen entries in column 0, so [3, 3, 4] in row 2 and the initial [4, 4] in row 1 
are respectively changed into [3,4,4] and [3,3]. The tableaux resulting for these two operations are 
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One sees that the weights of these tableaux have the required relation to the weight a = (7, 1, 3, 3, 5, 3, 5, 2) 
of the initial tableau, notably the weight a' = (7, 1, 3, 4, 4, 3, 5, 2) of the final tableau satisfies Sa + Sa' = 0. 

Now let us turn to more useful applications of lemma 2.2, for which we do not know the result 
beforehand. With respect to the application in the previous example one could replace the choice /? = (0) 
by one taking for /? an arbitrary partition /x, which results in a formula for products s\s^ of Schur 
functions, or one could replace the choice f = s\hy the more general / = s>,/„, resulting in a formula for 
decomposing skew Schur functions into ordinary ones. In view of the relation ^ .sa^;,, | s,y ) = ( | Sv/fj. ) 
the two problems are equivalent, but it will be interesting to perform both generalisations at once, 
which in fact produces no additional complications. So we apply lemma 2.2 with / = Sa/k (and hence 
/[^n] = ETessT(A/«) X^(^^) and /3 = M e T', which gives 

SA/kSa* = SM+wt(T)- (49) 

TeSST(A/K) 

We can obtain a somewhat more symmetric formulation by taking coefficients of some Schur function 
on both sides, which amounts to taking the scalar product with it; using ( sa/kS^ I ) ~ ( -^a/k | Si^//* ) 
this gives 

( SA/« I s./^* ) = Yl ff(M + wt(T),i/). (50) 

TeSST(A/K) 
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Wc shall now describe a cancellation that can be applied to the right hand side of (49), which 
was proposed in [Stem]; in fact a rather similar cancellation argument, but using a somewhat different 
terminology, can already be found in [BeZe, Theorem 4] . Contrary to what we saw for (48) there will in 
general be more than one term that survives the cancellation, but for those tableaux T whose term does 
cancel, an entry m + 1 in its display will be indicated that triggers the cancellation, and a tableau T' 
whose terms cancels that of T will be constructed by possibly modifying entries m and m + 1, in much 
the same way as before. In particular all changes will be strictly to the left and weakly below the entry 
triggering the cancellation. 

The condition for cancellation is stated in terms of the weights of individual columns of T. Define the 
weight cwtj(T) of column j of T to be the binary composition a whose part ai 6 {0, 1} tells whether or 
not there is an entry i in column j of the display of T, in other words, cwtj (T) is the column (M')j of the 
binary encoding M' of T. For later use also define the weight rwti(T) of row i of T as the composition (3 
whose part [3j gives the number of entries j in row i of the display of T; thus rwti(T) equals row Mj of 
the integral encoding M of T. One obviously has wt(T) = '^^^A'^) — SieN i"wtj(T). 

The terms in the right hand side of (49) that will survive the cancellation are those for the tableaux T 
with the property that for alH G N one has /i + X]j>i cwtj(T) G V: in particular fi + wt(T) is a partition, 
so that the term for such T is already normalised. So once the cancellation is established, it will prove 

sx/^s^= J2 [V?GN:/x + ^cwt,(T)eP]s^+wt(T), (51) 

TeSST(A/K) j>l 

which is a generalisation of the Littlewood-Richardson rule. The tableaux that satisfy the condition are 
those that we termed /z-dominant in [vLce, §1.4] (although it requires a bit of work to see this), and 
for fi = (0) these arc the traditional Littlewood-Richardson tableaux; as one can see, these describe the 
decomposition of Sx/k in the basis of Schur functions. By taking instead k, = (0), one obtains a description 
of the decomposition of s^s^ in that basis, in terms of /x-dominant tableaux of shape A. 

For any T that fails the condition in (51), there is a maximal I such that n + '^j>i cwtj{T) ^ V; 
we fix this I and put a = n + X]j>i '^'^^^(T). Since a ^ V while /x + J2j>i cwtj(T) = a — cwti{T) G P, 
and cwt;(T) is a binary composition, it must be that there exist some m with am+i — otm + 1, which 
can also be written as a[m] = a[m + 1]. We choose the minimal such m, which then necessarily also has 
cwt;(r)TO = and cwt;(T)m+i = 1, so column / of the display of T contains no entry m, but it does 
contain an entry m + 1. That entry will be the one that is considered to trigger the cancellation of the 
term for T. 

Since the entries in columns j > I of the display of T are involved in triggering the cancellation 
and determining the values of I and m, they will not be altered during the construction of a tableau T' 
whose term cancels that of T; this will ensure that the cancellation is triggered identically for T' as it 
was for T. As before the subtableau of entries m, m + 1 in columns j < I can be treated in isolation, 
since the frozen entry m + 1 in column I imposes no constraint. One has ^ + wt(T) = a + J2j<i cwtj(T) 
and for T' one will similarly have /U + wt(T') = a + "^j^i cwtj(r'); since a[m] = a[m + 1], the desired 
cancellation s^+wt(T) + S/:i+wt(T') = can be assured by having cwtj(T') differ from J2j<i cwtj(T) 
by the interchange of the parts indexed by m and m+1. Therefore T' can be obtained from T by applying 
the Bcndcr-Kmith involution for m, m + 1 to the subtableau in columns j < I. 

As an example consider the tableau T of (2) with /z = (7,6,5,4,2); one finds I = 5, for which 
a = n+ {0,0, 1, 0, 1, 2, 3) = (7, 6, 6, 4, 3, 2, 3) and m = 5, and the entry m+ 1 = 6 at position (1, 5) triggers 
the cancellation. The tableau T' is found by applying the Bender-Knuth involution for interchanging 
5 and 6 to the subtableau formed by the 5 leftmost columns of T, which transforms 
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Indeed one has s^+wt(T) + S/i+wt(T') = S(9,9,8,6,6,4,7) + •8(9,9,8,6,6,6,5) = 0- 

The condition for yu-dominance that appears in (51) may appear somewhat strange at first sight, 
but it has a remarkable resemblance to the condition that characterises binary encodings of tableaux. To 
bring out this resemblance, we shall replace the tableau T in the summation by its binary encoding M, 
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so that Mj will replace cwtj(r), and row(Af) will replace wt(T). Also we shall apply our cancellations 
to the right hand side of (50) rather than of (49). One has col(M) = A* - k* for any M e TabP'(A/K), 
and those M giving nonzero terms after cancellation satisfy row(M) = v — ji. Spelling out explicitly the 
condition that M G Tabl'^'(A/K), we therefore obtain the following statement, equivalent to (51). 

5.1. Theorem, (generalised Littlewood-Richardson rule, binary formulation) For all k, \n,y& V: 

There is another formulation of the condition of /x-dominance than the one that appears in (51), 
which uses row weights rather than column weights, and which is actually closer to the way the notion 
was defined in [vLee] . One can determine independently for each value of m whether any entry m + 1 
in the display of T is a candidate for triggering cancellation. Although in case of multiple candidates, 
only the topmost one in the rightmost column containing any candidates will be selected, that rule has 
no influence on the set of tableaux that will survive cancellation, namely those for which no candidates 
exist at all. In order for an entry m + 1 in column I to be candidate for triggering cancellation, one must 
have a[m] = a[m + 1] for a = i^ + ^jyi cwtj{T), and I must be the largest value for which this condition 
holds. Then there will not be any entry m above that entry m + 1 in the same column I of T, and this 
means that if the entry m + 1 is in row k, one also has a'[m] = a'[m + 1] for a' = + J2i<:k i'wti(T) + (3, 
where (3 is the weight of the part of row fc of T consisting of the candidate entry and all entries to its 
right. 

Conversely any entry m+1 for which this condition in terms of a' holds, and for which k is minimal, 
cannot have an entry m in the same column, and is therefore a candidate for triggering cancellation. 
Then, assuming there arc no candidate entries m, + 1 in rows i < k, one may test if there is any such 
entry m + 1 in row k, by considering a" = + Ylii<k rwt,(T) + (}' where /?' the weight of the part of 
row k consisting of entries exceeding m (in other words is obtained from rwtfe(M) by clearing its 
parts at indices < to): there will be such a candidate if and only if a'^a+i > '^m- This use of weights 
of partial rows can be avoided in the formulation, by comparing parts of two different compositions: if 
one defines a*^*°' = /U + X]i<io i'wti(r) for iq G N, then the final condition a'^^i > a'^ is equivalent to 

< Q!^+i''. For instance in our example above, the fact that some entry 6 in row 1 is a candidate for 
triggering cancellation, can be deduced from < where a = ;U+ (1, 0, 1, 0, 1, 2) = (8,6,6,4,3,2) 
and a^^^ = a^^^ + (1, 1, 0, 1, 1, 0, 3) — (9, 7, 6, 5, 4, 2, 3) (only the underlined entries are relevant). 

Now let us return to formulating /i-dominanco, the condition stating that the term for T is not 
cancelled at all. This means that the condition just stated fails for all k and for all to. The failure for 
given k and for all to simply becomes a^''^ ^ a^*^+^', which means that we can reformulate (51) as follows: 

SA/«S/.= J2 [V/ceN:^ + ^rwti(T)^/i + ^rwti(r)]s^+^t(T). (53) 

TeSST(A/K) i<k i<k 

Note that, somewhat surprisingly, the sequence (Q;^*-')ieN of compositions actually is a semistandard 
tableau when T survives the cancellation; its shape is + wt(T)//i, and in the terminology of [vLee] 
it is the companion tableau of T witnessing the /x-dominancc of T. If this time we replace T in the 
summation by its integral encoding, we obtain a symmetry in the formulation that is even more striking 
than the one in theorem 5.1. Again we take (50) as a starting point, and obtain the following statement, 
which is equivalent to (53), and therefore to (51), like the previous theorem. This time the condition 
that selects the matrices whose term survives the cancellation is exactly like the condition for being an 
integral encoding in the first place, except for the interchange of rows and columns; therefore we add a 
succinct reformulation of the conditions, using the notation introduced in definition 1.2.3. 

5.2. Theorem, (generalised Littlewood-Richardson rule, integral formulation) For all k. A, /i, e "P 

( ^\/k I ^v/n ) 

^ [V^:K + ^Mj^« + ^M;][Vfc:M + ^Mi^M + E^il 

MeMx-^,t,-u, j<l j<l i<k i<k 

= ^ [ M e Tabl(A/K) ] [ M' G Tabl(!//M) ] . □ 
MeM 
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Wc have completed the description of our final application of lemma 2.2, and of the result of applying 
cancellations to the expression it yields. There is however one more variation that we would like to present. 
In theorems 5.1 and 5.2 we have given expressions for ( 8^/,^ \ s^ly. ) in terms of matrices that treat rows 
and columns similarly. But our starting point for these expressions, equation (50), does not exhibit such 
a symmetry when we replace the semistandard tableaux in the summation by their binary or integral 
encoding matrices. We would like to give expressions involving alternating sums with the same kind of 
symmetry as the mentioned theorems, and from which those theorems can be obtained by cancellations. 
This will require that we proceed backwards, introducing a second layer of signs by a kind of inverse 
cancellation process. In fact we shall use equations (27) and (28) to translate the Kostka numbers, which 
are implicit in the summation over semistandard tableaux, into alternating sums over matrices. 

Concretely, we write the right hand side of (50) as X^^gc -^a/k,/3£(m + /?) ^)) a^^d then we can apply 
(27), which gives 

< SA/« I s^i^ ) = e{K + row(M), \)e{n + col(M), v), (54) 
or we can apply (28) to Kx/k,0 = -f^A'/re' ,9' which gives (transposing the binary matrices M) 

( sa/« I s,/^ ) = ^ e(K* + col(M), A')£(/i + row(M), v). (55) 

It is clear from the way we have obtained these equations, that one can go from them to theorems 
5.2 and 5.1, respectively, by applying two successive phases of cancellation. The first phase is a Gessel- 
Viennot cancellation of terms with opposite values of £(K + row(M), A) respectively of e(K* + col(Af), A*), 
as in (43) and (47), which gives us back (50) with tableaux replaced by their integral or binary encodings. 
The second cancellation phase cancels within the remaining terms pairs with opposite values of the 
remaining factor £(/i + co\{M),v), using the cancellation based on the Bender-Knuth involutions that 
was given following (51), translated in terms of integral or binary encodings. 

It is interesting to compare the way in which these two cancellation phases proceed. The Gessel- 
Viennot cancellations search for a failure of the condition Ad e Tabl(A/K) respectively M G TabP'(A*/K*) 
by successively computing the compositions a*-'^ that would define the shapes (or their transposes in the 
binary case) forming the semistandard tableau encoded by M if one exists; as soon as one finds a^'^ x/- 
Q,(«+i) respectively a^'^ ^ a(*+^\ the cancellation of the term for M is detected, and the construction 
of a cancelling matrix M' starts. Up to this point the second cancellation phase proceeds in quite the 
same way, differing only in whether weights of rows or columns of M are successively added to the 
compositions a^^\ and in the fact that for the binary case these cohmms are taken in the opposite order. 

But after detecting cancellation and selecting a pair of indices responsible for it, the Gessel-Viennot 
cancellations construct M' from M by simply interchanging entries for those indices in the rows or 
columns that had not yet been considered; in the integral case there is also a transfer at those indices 
in the column in which cancellation was detected itself, which can be interpreted as interchange of those 
parts of the values of the entries concerned that remain after the cancellation has been triggered (the 
parts of their paths after the first point of contact). Such a simple rule would not work in the Bender- 
Knuth case: in the binary case weak increase along rows would be violated, while in the integral case 
strict increase in columns would be violated (in either case this is the only one of the two conditions that 
can be violated, as the other one is assured by the way tableaux are encoded by matrices). 

What the Bender-Knuth involutions actually do in terms of integral and binary encodings is not 
easily described directly, although one can say in general that the transfers between entries are more 
limited than what they would be for a Gessel-Viennot cancellation. This can be seen by considering the 
encodings of the tableaux T, T' in (52). The changes only involve entries 5 and 6, so we just show the 
changes to columns 5 and 6 of the integral encoding and to rows 5 and 6 of the binary encoding: 

1 1 1 1' 
11111110^ 
and ' I ' . (56) 

110 110 11' 
10 11110^ 

The entries where cancellation is triggered have been underlined (in the integral case it is the final unit of 
the entry 3, i.e., a value 2 at the same position would not trigger cancellation). One sees that the parts of 
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the cohinins below that entry (in the integral case) or of the rows to the left of that entry (in the binary 
case) have not been completely interchanged. The precise changes depend on the shape A*^^) = (7, 5, 4, 3) 
in the tableau T to which the entries 5 in its display are added, and in terms of the matrices this shape 
depends both on fi and on the columns to the left of those that were shown, respectively the rows above 
those that were shown. 

One may wonder, both for (54) and for (55), if it is possible to find a pair of cancellations that 
reduces them to theorem 5.2 respectively to theorem 5.1, in a manner that better respects the symmetry 
present both in the initial and in the final expressions. In particular one would like to be able to apply 
the two cancellations in either order, with the cancellation being applied last operating only on the 
terms that survived the first cancellation. This requires that either cancellation respects the condition 
expressing survival under the other cancellation: it will never cancel a survivor of the other cancellation 
against a non-survivor. Yet while the involutions defining such cancellations should allow restriction to 
the survivors of the other cancellation, their definition should not be limited to those survivors (as is the 
case for the Bender-Knuth involutions, which are only defined for semistandard tableaux): they should be 
as general as the Gessel-Viennot cancellations. We shall show in a forthcoming sequel to this paper that 
both for (54) and for (55) such a pair of cancellations does indeed exist. Moreover the operations used 
to define these cancellations are quite interesting in their own right, and they give rise to ramifications 
that relate to Knuth correspondences and to jeu de taquin. 
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